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INSECnCIDAL TOXINS FROM PHOTORHABDUS 

The invention relates to novel toxins from Photorhabdus luminescens, nucleic acid 
sequences whose expression results in said toxins, and methods of making and methods of 
using the toxins and corresponding nucleic acid sequences to control insects. 

Insect pests are a major cause of crop losses. Solely in the US, about $7.7 billion 
are lost every year due to infestation by various genera of insects. In addition to losses in 
field crops, insect pests are also a burden to vegetable and fruit growers, to producers of 
ornamental flowers, and they are a nuisance to gardeners and home owners. 

Insect pests are mainly controlled by intensive applications of chemical insecticides, 
which are active through inhibition of insect growth, prevention of insect feeding or 
reproduction, or death of the insects. Good insect control can thus be reached, but these 
chemicals can sometimes also affect other, beneficial insects. Another problem resulting 
from the wide use of chemical pesticides is the appearance of resistant insect varieties. 
This has been partially alleviated by various resistance management strategies, but there is 
an increasing need for alternative pest control agents. Biological insect control agents, 
such as Bacillus thuringiensis strains expressing insecticidal toxins like d-endotoxins, have 
also been applied with satisfactory results, offering an alternative or a complement to 
chemical insecticides. Recently, the genes coding for some of these d-endotoxins have 
been isolated and their expression in heterologous hosts have been shown to provide 
another tool for the control of economically important insect pests. In particular, the 
expression of insecticidal toxins in transgenic plants, such as Bacillus thuringiensis d- 
endotoxins, has provided efficient protection against selected insect pests, and transgenic 
plants expressing such toxins have been commercialized, allowing farmers to reduce 
applications of chemical insect control agents. Yet. even in this case, the development of 
resistance remains a possibility and only a few specific insect pests are controllable. 
Consequently, there remains a long-felt but unfulfilled need to discover new and effective 
insect control agents that provide an economic benefit to farmers and that are 
environmentally acceptable. 

The present invention addresses the need for novel insect control agents. 
Particularly needed are control agents that are targeted to economically important insect 
pests and that efficiently control insect strains resistant to existing insect control agents. 
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Furthermore, agents whose application minimizes the burden on the environment are 
desirable. 

In the search of novel insect control agents, certain classes of nematodes from the 
genera Heterorhabdus and Steinernema are of particular interest because of their 
insecticidal properties. They kill insect larvae and their offspring feed in the dead larvae. 
Indeed, the insecticidal activity is due to symbiotic bacteria living in the nematodes. These 
symbiotic bacteria are Photorhabdus in the case of Heterorhabdus and Xer)orhabdus in the 
case of Steinernema, 

The present invention is drawn to nucleic acid sequences isolated from Photorhabdus 
luminescens, and sequences substantially similar thereto, whose expression results in 
toxins that are highly toxic to economically important insect pests, particularly insect pests 
that infest plants. The invention is further drawn to the toxins resulting from the expression 
of the nucleic acid sequences, and to compositions and formulations containing the toxins, 
which are capable of inhibiting the ability of insect pests to survive, grow or reproduce, or of 
limiting insect-related damage or loss in crop plants. The invention is further drawn to a 
method of making the toxins and to methods of using the nucleic acid sequences, for 
example in microorganisms to control insects or in transgenic plants to confer insect 
resistance, and to a method of using the toxins, and compositions and formulations 
comprising the toxins, for example applying the toxins or compositions or formulations to 
insect-infested areas, or to prophylactically treat insect-susceptible areas or plants to confer 
protection or resistance to the insects. 

The novel toxins are highly active against insects. For example, a number of 
economically important insect pests, such as the Lepidopterans Plutella xylostella 
(Diamondback Moth), Trichoplusia ni (Cabbage Looper), Ostrinia nubilalis (European Corn 
Borer), Heliothis virescens (Tobacco Budworm), Helicoverpa zea (Corn Earworm), Manduca 
sexta (Tobacco Hornworm), Spodoptera exigua (Beet Armyworm), and Spodoptera 
frugiperda (Fall Armyworm), as well as the Coleopterans Diabrotica virgifera virgifera 
(Western Corn Rootworm), Diabrotica undecimpunctata howardi (Southern Corn 
Rootworm), and Leptinotarsa decimlineata (Colorado Potato Beetle) can be controlled by 
one or more of the toxins. The toxins can be used in multiple insect control strategies, 
resulting in maximal efficiency with minimal impact on the environment. 

According to one aspect, the present invention provides an isolated nucleic acid 
molecule comprising: (a) a nucleotide sequence substantially similar to a nucleotide 
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sequence selected from the group consisting of: nucleotides 412-1665 of SEQ ID NO:1, 
nucleotides 1686-2447 of SEQ ID NO:1, nucleotides 2758-3318 of SEQ ID NO:1, 
nucleotides 3342-4118 of SEQ ID NO:1, nucleotides 4515-9269 of SEQ ID NO:1, 
nucleotides 15,171-18,035 of SEQ ID NO:11, and nucleotides 31,393-35.838 of SEQ ID 
NO:11; (b) a nucleotide sequence comprising nucleotides 23,768-31,336 of SEQ ID NO:11; 
or (c) a nucleotide sequence isocoding with the nucleotide sequence of (a) or (b); wherein 
expression of the nucleic acid molecule results in at least one toxin that is active against 
insects. 

In one embodiment of this aspect, the nucleotide sequence is isocoding with a 
nucleotide sequence substantially similar to nucleotides 412-1665 of SEQ ID NO:1, 
nucleotides 1686-2447 of SEQ ID NO:1, nucleotides 2758-3318 of SEQ ID NO:1, 
nucleotides 3342-4118 of SEQ ID NO:1, or nucleotides 4515-9269 of SEQ ID NO:1. 
Preferably, the nucleotide sequence is substantially similar to nucleotides 412-1665 of SEQ 
ID NO:1, nucleotides 1686-2447 of SEQ ID NO:1, nucleotides 2758-3318 of SEQ ID NO:1, 
nucleotides 3342-4118 of SEQ ID NO:1, or nucleotides 4515-9269 of SEQ ID NO:1. More 
preferably, the nucleotide sequence encodes an amino acid sequence selected from the 
group consisting of SEQ ID NOs:2-6. Most preferably, the nucleotide sequence comprises 
nucleotides 412-1665 of SEQ ID NO:1, nucleotides 1686-2447 of SEQ ID NO:1, nucleotides 
2758-3318 of SEQ ID NO:1, nucleotides 3342-4118 of SEQ ID NO:1, or nucleotides 4515- 
9269 of SEQ ID NO:1. 

In another embodiment of this aspect, the nucleotide sequence is isocoding with a 
nucleotide sequence substantially similar to nucleotides 15,171-18,035 of SEQ ID NO:11. 
Preferably, the nucleotide sequence is substantially similar to nucleotides 15,171-18,035 of 
SEQ ID NO:11. More preferably, the nucleotide sequence encodes the amino acid 
sequence set forth in SEQ ID NO: 12. Most preferably, the nucleotide sequence comprises 
nucleotides 15,171-18,035 of SEQ ID NO:11. 

In still another embodiment of this aspect, the nucleotide sequence is isocoding with 
a nucleotide sequence substantially similar to nucleotides 31,393-35,838 of SEQ ID NO:1 1. 
Preferably, the nucleotide sequence is substantially similar to nucleotides 31.393-35,838 of 
SEQ ID NO:11. More preferably, the nucleotide sequence encodes the amino acid 
sequence set forth in SEQ ID NO:14. Most preferably, the nucleotide sequence comprises 
nucleotides 31 .393-35,838 of SEQ ID NO:1 1 . 
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In yet another embodiment of this aspect, the nucleotide sequence encodes the 
amino acid sequence set forth in SEQ ID NO: 13, and preferably comprises nucleotides 
23.768-31 ,336 of SEQ ID NO:1 1 . 

In one embodiment, the nucleotide sequence of the invention comprises the 
approximately 9.7 kb DNA fragment harbored in E. coli strain DH5a, designated as NRRL 
accession number B-21835. 

In another embodiment, the nucleotide sequence of the invention comprises the 
approximately 38 kb DNA fragment harbored in £. coli strain DH5a. designated as NRRL 
accession number B-30077. 

In still another embodiment, the nucleotide sequence of the invention comprises the 
approximately 22.2 kb DNA fragment harbored in E. coli strain DH5a. designated as NRRL 
accession number B-30078. 

According to one embodiment of the invention, the toxins resulting from expression of 
the nucleic acid molecules of the invention have activity against Lepidopteran insects. 
Preferably, according to this embodiment, the toxins have activity against Plutella xylostella 
(Diamondback Moth), Trichoplusia ni (Cabbage Looper). Ostrinia nubHalis (European Corn 
Borer). Heliothis virescens (Tobacco Budworm), Helicoverpa zea (Corn Earworm), 
Spodoptera exigua (Beet Armyworm), and Spodoptera frugiperda (Fall Armyworm). 

According to another embodiment of the invention, the toxins resulting from 
expression of the nucleic acid molecule of the invention have activity against Lepidopteran 
and Coleopteran insects. Preferably, according to this embodiment, the toxins have 
insecticidal activity against Plutella xylostella (Diamondback Moth). Ostrinia nubilalis 
(European Corn Borer), and Manduca sexta (Tobacco Hornworm), Diabrotica virgifera 
virgifera (Western Corn Rootworm), Diabrotica ur)decimpunctata howardi (Southern Corn 
Rootworm). and Leptinotarsa decimlineata (Colorado Potato Beetle). 

In another aspect, the present invention provides an isolated nucleic acid molecule 
comprising a 20 base pair nucleotide portion identical in sequence to a consecutive 20 base 
pair nucleotide portion of a nucleotide sequence selected from the group consisting of: 
nucleotides 412-1665 of SEQ ID NO:1. nucleotides 1686-2447 of SEQ ID NO:1. nucleotides 
2758-3318 of SEQ ID NO:1. nucleotides 3342-4118 of SEQ ID NO:1. nucleotides 4515- 
9269 of SEQ ID NO:1. nucleotides 15.171-18,035 of SEQ ID NO:11. and nucleotides 
31,393-35,838 of SEQ ID NO:1 1. wherein expression of the nucleic acid molecule results in 
at least one toxin that is active against insects. 
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In one embodiment of this aspect, the isolated nucleic acid molecule of the invention 
comprises a 20 base pair nucleotide portion identical in sequence to a consecutive 20 base 
pair nucleotide portion of nucleotides 412-1665 of SEQ ID NO:1, nucleotides 1686-2447 of 
SEQ ID NO:1, nucleotides 2758-3318 of SEQ ID NO:1. nucleotides 3342-4118 of SEQ ID 
NO:1, or nucleotides 4515-9269 of SEQ ID NO:1. 

In another embodiment of this aspect, the isolated nucleic acid molecule of the 
invention comprises a 20 base pair nucleotide portion identical in sequence to a 
consecutive 20 base pair nucleotide portion of nucleotides 15,171-18,035 of SEQ ID 
NO:11. 

In still another embodiment of this aspect, the isolated nucleic acid molecule of the 
invention comprises a 20 base pair nucleotide portion identical in sequence to a 
consecutive 20 base pair nucleotide portion of nucleotides 31,393-35,838 of SEQ ID 
NO:11. 

In a further aspect, the present invention provides an isolated nucleic acid molecule 
comprising a nucleotide sequence from Photorhabdus luminescens selected from the group 
consisting of: nucleotides 412-1665 of SEQ ID NO:1, nucleotides 1686-2447 of SEQ ID 
NO:1, nucleotides 2758-3318 of SEQ ID NO:1, nucleotides 3342-4118 of SEQ ID NO:1, 
nucleotides 4515-9269 of SEQ ID NO:1, nucleotides 66-1898 of SEQ ID NO:11, 
nucleotides 2416-9909 of SEQ ID NO:11, the complement of nucleotides 2817-3395 of 
SEQ ID NO:11, nucleotides 9966-14,633 of SEQ ID NO:11, nucleotides 14,699-15,007 of 
SEQ ID NO:11, nucleotides 15,171-18,035 of SEQ ID NO:11, the complement of 
nucleotides 17,072-17,398 of SEQ ID NO:11, the complement of nucleotides 18,235- 
19,167 of SEQ ID NO:11, the complement of nucleotides 19,385-20,116 of SEQ ID NO:11, 
the complement of nucleotides 20,217-20,963 of SEQ ID NO:11, the complement of 
nucleotides 22,172-23,086 of SEQ ID NO:11, nucleotides 23,768-31,336 of SEQ ID NO:11, 
nucleotides 31,393-35,838 of SEQ ID NO:11, the complement of nucleotides 35,383- 
35,709 of SEQ ID NO:1 1 , the complement of nucleotides 36,032-36,661 of SEQ ID NO:1 1 , 
and the complement of nucleotides 36,654-37,781 of SEQ ID NO:1 1 . 

The present invention also provides a chimeric gene comprising a heterologous 
promoter sequence operatively linked to the nucleic acid molecule of the invention. Further, 
the present invention provides a recombinant vector comprising such a chimeric gene. Still 
further, the present invention provides a host cell comprising such a chimeric gene. A host 
cell according to this aspect of the invention may be a bacterial cell, a yeast cell, or a plant 
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cell, preferably a plant cell. Even further, the present invention provides a plant comprising 
such a plant cell. Preferably, the plant is maize. 

In yet another aspect, the present invention provides toxins produced by the 
expression of DNA molecules of the present invention. 

According to one embodiment, the toxins of the invention have activity against 
Lepidopteran insects, preferably against Plutella xylostella (Diamondback Moth), 
Trichoplusia ni (Cabbage Looper), Ostrinia nubilalis (European Corn Borer), Heliothis 
virescens (Tobacco Budworm), Helicoverpa zea (Corn Earworm), Spodoptera exigua (Beet 
Armyworm), and Spodoptera frugiperda (Fall Armyworm). 

According to another embodiment, the toxins of the invention have activity against 
Lepidopteran and Coleopteran insects, preferably against Plutella xylostella (Diamondback 
Moth), Ostrinia nubilalis (European Corn Borer), and Manduca sexta (Tobacco Hornworm). 
Diabrotica virgifera virgifera (Western Corn Rootwonn), Diabrotica undecimpunctata 
howardi (Southern Corn Rootworm), and Leptinotarsa decimlineata (Colorado Potato 
Beetle). 

In one embodiment, the toxins are produced by the E. coll strain designated as 
NRRL accession number B-21835. 

In another embodiment, the toxins are produced by E. coll strain designated as 
NRRL accession number B-30077. 

In still another embodiment, the toxins are produced by E. coll strain designated as 
NRRL accession number B-30078, 

In one embodiment, a toxin of the invention comprises an amino acid sequence 
selected from the group consisting of: SEQ ID NOs:2-6. 

In another embodiment, a toxin of the invention comprises an amino acid sequence 
selected from the group consisting of: SEQ ID NOs:12-14. 

The present invention also provides a composition comprising an insecticidally 
effective amount of a toxin according to the invention. 

In another aspect, the present invention provides a method of producing a toxin that is 
active against insects, comprising: (a) obtaining a host cell comprising a chimeric gene, 
which itself comprises a heterologous promoter sequence operatively linked to the nucleic 
acid molecule of the invention; and (b) expressing the nucleic acid molecule in the cell, 
which results in at least one toxin that is active against insects. 
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In a further aspect, the present invention provides a method of producing an insect- 
resistant plant, comprising introducing a nucleic acid molecule of the invention into the 
plant, wherein the nucleic acid molecule is expressible in the plant in an effective amount to 
control insects. According to one embodiment, the insects are Lepidopteran insects, 
preferably selected from the group consisting of: Plutella xylostella (Diamondback Moth), 
Trichoplusia ni (Cabbage Looper), Ostrinia nubilalis (European Corn Borer), Heliothis 
virescens (Tobacco Budworm), Helicoverpa zea (Corn Earworm), Spodoptera exigua (Beet 
Armyworm), and Spodoptera frugiperda (Fall Armyworm). According to another 
embodiment, the insects are Lepidopteran and Coleopteran insects, preferably selected 
from the group consisting of: Plutella xylostella (Diamondback Moth), Ostrinia nubilalis 
(European Corn Borer), and Manduca sexta (Tobacco Homworm), Diabrotica virgifera 
virgifera (Western Corn Rootworm), Diabrotica undecimpunctata howardi (Southern Com 
Rootworm), and Leptinotarsa. decimlineata (Colorado Potato Beetle). 

In still a further aspect, the present invention provides a method of controlling insects 
comprising delivering to the insects an effective amount of a toxin according to the present 
invention. According to one embodiment, the insects are Lepidopteran insects, preferably 
selected from the group consisting of: Plutella xylostella (Diamondback Moth), Trichoplusia 
ni (Cabbage Looper). Ostrinia nubilalis (European Com Borer), Heliothis virescens 
(Tobacco Budworm), Helicoverpa zea (Corn Earworm), Spodoptera exigua (Beet 
Armyworm), and Spodoptera frugiperda (Fall Armyworm). According to another 
embodiment, the insects are Lepidopteran and Coleopteran insects, preferably selected 
from the group consisting of: Plutella xylostella (Diamondback Moth), Ostrinia nubilalis 
(European Corn Borer), and Manduca sexta (Tobacco Homworm). Diabrotica virgifera 
virgifera (Western Corn Rootworm), Diabrotica undecimpunctata howardi (Southern Com 
Rootworm). and Leptinotarsa decimlineata (Colorado Potato Beetle). Preferably, the toxin 
is delivered to the insects orally. 

Yet another aspect of the present invention is the provision of a method for 
mutagenizing a nucleic acid molecule according to the present invention, wherein the 
nucleic acid molecule has been cleaved into population of double-stranded random 
fragments of a desired size, comprising: (a) adding to the population of double-stranded 
random fragments one or more single- or double-stranded oligonucleotides, wherein the 
oligonucleotides each comprise an area of identity and an area of heterology to a double- 
stranded template polynucleotide; (b) denaturing the resultant mixture of double-stranded 
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random fragments and oligonucleotides into single-stranded fragments; (c) incubating the 
resultant population of single-stranded fragments with a polymerase under conditions which 
result in the annealing of the single-stranded fragments at the areas of identity to form pairs 
of annealed fragments, the areas of identity being sufficient for one member of a pair to 
prime replication of the other, thereby forming a mutagenized double-stranded 
polynucleotide; and (d) repeating the second and third steps for at least two further cycles, 
wherein the resultant mixture in the second step of a further cycle includes the mutagenized 
double-stranded polynucleotide from the third step of the previous cycle, and wherein the 
further cycle forms a further mutagenized double-stranded polynucleotide. 

Other aspects and advantages of the present invention will become apparent to those 
skilled in the art from a study of the following description of the invention and non-limiting 
examples. 

DEFINITIONS 

"Activity" of the toxins of the invention is meant that the toxins function as orally 
active insect control agents, have a toxic effect, or are able to disrupt or deter insect 
feeding, which may or may not cause death of the insect. When a toxin of the invention is 
delivered to the insect, the result is typically death of the insect, or the insect does not feed 
upon the source that makes the toxin available to the insect. 

"Associated with / operatively linked" refer to two nucleic acid sequences that are 
related physically or functionally. For example, a promoter or regulatory DNA sequence is 
said to be "associated with" a DNA sequence that codes for an RNA or a protein if the two 
sequences are operatively linked, or situated such that the regulator DNA sequence will 
affect the expression level of the coding or structural DNA sequence. 

A "chimeric gene" is a recombinant nucleic acid sequence in which a promoter or 
regulatory nucleic acid sequence is operatively linked to, or associated with, a nucleic acid 
sequence that codes for an mRNA or which is expressed as a protein, such that the 
regulator nucleic acid sequence is able to regulate transcription or expression of the 
associated nucleic acid sequence. The regulator nucleic acid sequence of the chimeric 
gene is not normally operatively linked to the associated nucleic acid sequence as found in 
nature. 
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A "coding sequence" is a nucleic acid sequence that is transcribed into RNA such as 
mRNA, rRNA, tRNA, snRNA, sense RNA or antisense RNA. Preferably the RNA is then 
translated in an organism to produce a protein. 

To "control" insects means to inhibit, through a toxic effect, the ability of insect pests 
to survive, grow, feed, and/or reproduce, or to limit insect-related damage or loss in crop 
plants. To "control" insects may or may not mean killing the insects, although it preferably 
means killing the insects. 

To "deliver" a toxin means that the toxin comes in contact with an insect, resulting in 
toxic effect and control of the insect. The toxin can be delivered in many recognized ways, 
e.g., orally by ingestion by the insect or by contact with the insect via transgenic plant 
expression, formulated protein composition(s), sprayable protein composition(s). a bait 
matrix, or any other art-recognized toxin delivery system. 

"Expression cassette" as used herein means a nucleic acid sequence capable of 
directing expression of a particular nucleotide sequence in an appropriate host cell, 
comprising a promoter operably linked to the nucleotide sequence of interest which is 
operably linked to termination signals. It also typically comprises sequences required for 
proper translation of the nucleotide sequence. The expression cassette comprising the 
nucleotide sequence of interest may be chimeric, meaning that at least one of its 
components is heterologous with respect to at least one of its other components. The 
expression cassette may also be one which is naturally occurring but has been obtained in 
a recombinant form useful for heterologous expression. Typically, however, the expression 
cassette is heterologous with respect to the host, i.e., the particular nucleic acid sequence 
of the expression cassette does not occur naturally in the host cell and must have been 
introduced into the host cell or an ancestor of the host cell by a transformation event. The 
expression of the nucleotide sequence in the expression cassette may be under the control 
of a constitutive promoter or of an inducible promoter which initiates transcription only when 
the host cell is exposed to some particular external stimulus. In the case of a multicellular 
organism, such as a plant, the promoter can also be specific to a particular tissue, or organ, 
or stage of development. 

A "gene" is a defined region that is located within a genome and that, besides the 
aforementioned coding nucleic acid sequence, comprises other, primarily regulatory, nucleic 
acid sequences responsible for the control of the expression, that is to say the transcription 
and translation, of the coding portion. A gene may also comprise other 5* and 3' 
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untranslated sequences and termination sequences. Further elements that may be present 
are, for example, introns. 

"Gene of interest" refers to any gene which, when transferred to a plant, confers upon 
the plant a desired characteristic such as antibiotic resistance, virus resistance, insect 
resistance, disease resistance, or resistance to other pests, herbicide tolerance, improved 
nutritional value, improved performance in an industrial process or altered reproductive 
capability. The "gene of interest" may also be one that is transferred to plants for the 
production of commercially valuable enzymes or metabolites in the plant. 

A "heterologous" nucleic acid sequence is a nucleic acid sequence not naturally 
associated with a host cell into which it is introduced, including non-naturally occurring 
multiple copies of a naturally occurring nucleic acid sequence. 

A "homologous" nucleic acid sequence is a nucleic acid sequence naturally 
associated with a host cell into which it is introduced. 

"Homologous recombination" is the reciprocal exchange of nucleic acid fragments 
between homologous nucleic acid molecules. 

"Insecticidal" is defined as a toxic biological activity capable of controlling insects, 
preferably by killing them. 

A nucleic acid sequence is "isocoding with" a reference nucleic acid sequence when 
the nucleic acid sequence encodes a polypeptide having the same amino acid sequence as 
the polypeptide encoded by the reference nucleic acid sequence. 

An "isolated" nucleic acid molecule or an isolated enzyme is a nucleic acid molecule 
or enzyme that, by the hand of man, exists apart from its native environment and is 
therefore not a product of nature. An isolated nucleic acid molecule or enzyme may exist in 
a purified form or may exist in a non-native environment such as, for example, a 
recombinant host cell. 

A "nucleic acid molecule" or "nucleic acid sequence" is a linear segment of single- or 
double-stranded DNA or RNA that can be isolated from any source. In the context of the 
present invention, the nucleic acid molecule is preferably a segment of DNA. 

"ORR means open reading frame. 

A ''plant" is any plant at any stage of development, particularly a seed plant. 

A "plant cell" is a structural and physiological unit of a plant, comprising a protoplast 
and a cell wall. The plant cell may be in form of an isolated single cell or a cultured cell, or 
as a part of higher organized unit such as. for example, plant tissue, a plant organ, or a 
whole plant. 
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"Plant cell culture" means cultures of plant units such as, for example, protoplasts, 
cell culture cells, cells in plant tissues, pollen, pollen tubes, ovules, embryo sacs, zygotes 
and embryos at various stages of development. 

"Plant material" refers to leaves, stems, roots, flowers or flower parts, fruits, pollen, 
egg cells, zygotes, seeds, cuttings, cell or tissue cultures, or any other part or product of a 
plant. 

A "plant organ" is a distinct and visibly structured and differentiated part of a plant 
such as a root, stem. leaf, flower bud, or embryo. 

"Plant tissue" as used herein means a group of plant cells organized into a structural 
and functional unit. Any tissue of a plant in planta or In culture is included. This term 
includes, but is not limited to, whole plants, plant organs, plant seeds, tissue culture and 
any groups of plant cells organized into structural and/or functional units. The use of this 
term in conjunction with, or in the absence of, any specific type of plant tissue as listed 
above or otherwise embraced by this definition is not intended to be exclusive of any other 
type of plant tissue. 

A "promoter" is an untranslated DNA sequence upstream of the coding region that 
contains the binding site for RNA polymerase II and initiates transcription of the DNA. The 
promoter region may also include other elements that act as regulators of gene expression. 

A "protoplast" is an isolated plant ceil without a cell wall or with only parts of the cell 

wall. 

"Regulatory elements" refer to sequences involved in controlling the expression of a 
nucleotide sequence. Regulatory elements comprise a promoter operably linked to the 
nucleotide sequence of interest and termination signals. They also typically encompass 
sequences required for proper translation of the nucleotide sequence. 

In its broadest sense, the term "substantially similar", when used herein with respect 
to a nucleotide sequence, means a nucleotide sequence corresponding to a reference 
nucleotide sequence, wherein the corresponding sequence encodes a polypeptide having 
substantially the same structure and function as the polypeptide encoded by the reference 
nucleotide sequence, e.g. where only changes in amino acids not affecting the polypeptide 
function occur. Desirably the substantially similar nucleotide sequence encodes the 
polypeptide encoded by the reference nucleotide sequence. The percentage of identity 
between the substantially similar nucleotide sequence and the reference nucleotide 
sequence desirably is at least 80%. more desirably at least 85%. preferably at least 90%, 
more preferably at least 95%, still more preferably at least 99%. A nucleotide sequence 
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"substantially similar" to reference nucleotide sequence hybridizes to the reference 
nucleotide sequence in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP04, 1 mM EDTA at 
50^C with washing in 2X SSC, 0.1% SDS at 50°C, more desirably in 7% sodium dodecyl 
sulfate (SDS), 0,5 M NaP04, 1 mM EDTA at 50°C with washing in IX SSC. 0.1% SDS at 
50^C, more desirably still in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP04, 1 mM EDTA at 
50°C with washing in 0.5X SSC, 0.1% SDS at 50X, preferably in 7% sodium dodecyl 
sulfate (SDS), 0.5 M NaP04, 1 mM EDTA at 50*'C with washing in 0.1 X SSC, 0.1% SDS at 
50X, more preferably in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP04, 1 mM EDTA at 
50°C with washing in 0.1 X SSC, 0.1% SDS at 65°C. 

"Synthetic" refers to a nucleotide sequence comprising structural characters that are 
not present in the natural sequence. For example, an artificial sequence that resembles 
more closely the G+C content and the normal codon distribution of dicot and/or monocot 
genes is said to be synthetic. 

'Transformation" is a process for introducing heterologous nucleic acid into a host 
cell or organism. In particular, "transformation" means the stable integration of a DNA 
molecule into the genome of an organism of interest. 

'Transformed / transgenic / recombinanf refer to a host organism such as a 
bacterium or a plant into which a heterologous nucleic acid molecule has been introduced. 
The nucleic acid molecule can be stably integrated into the genome of the host or the 
nucleic acid molecule can also be present as an extrachromosomal molecule. Such an 
extrachromosomal molecule can be auto-replicating. Transformed cells, tissues, or plants 
are understood to encompass not only the end product of a transformation process, but 
also transgenic progeny thereof. A "non-transformed", "non-transgenic", or "non- 
recombinanf host refers to a wild-type organism, e.g., a bacterium or plant, which does not 
contain the heterologous nucleic acid molecule. 

Nucleotides are indicated by their bases by the following standard abbreviations: 
adenine (A), cytosine (C). thymine (T). and guanine (G). Amino acids are likewise indicated 
by the following standard abbreviations: alanine (Ala; A), arginine (Arg; R). asparagine (Asn; 
N). aspartic acid (Asp; D), cysteine (Cys; C). glutamine (Gin; Q). glutamic acid (Glu; E). 
glycine (Gly; G), histidine (His; H). isoleucine (He; I), leucine (Leu; L). lysine (Lys; K), 
methionine (Met; M), phenylalanine (Phe; F). proline (Pro; P). serine (Ser; S), threonine 
(Thr; T). tryptophan (Trp; W), tyrosine (Tyr; Y), and valine (Val; V). Furthermore. (Xaa; X) 
represents any amino acid. 
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BRIEF DESCRIPTION OF THE SEQUENCES IN THE SEQUENCE LISTING 

SEQ ID NO:1 is the sequence of the approximately 9.7 kb DNA fragment comprised in 
pCIB9359-7 which comprises the following ORFs at the specified nucleotide positions: 

Name Start End 

orf1 412 1665 

orf2 1686 2447 

orfS 2758 3318 

orf4 3342 4118 

oris 4515 9269 

SEQ ID NO:2 is the sequence of the -46.4 kDa protein encoded by orfl of SEQ ID NO:1 . 
SEQ ID NO:3 is the sequence of the --28.1 kDa protein encoded by orf2 of SEQ ID NO:1 . 
SEQ ID NO:4 is the sequence of the --20.7 kDa protein encoded by orf3 of SEQ ID NO:1 . 
SEQ ID NO:5 is the sequence of the --28.7 kDa protein encoded by orf4 of SEQ ID NO:1. 
SEQ ID NO:6 Is the sequence of the --176 kDa protein encoded by orf5 of SEQ ID NO:1 . 
SEQ ID NOs:7-10 are oligonucleotides. 

SEQ ID NO:11 is the sequence of the approximately 38 kb DNA fragment comprised in 
pNOV2400, which comprises the following ORFs at the specified nucleotide positions 
(descending numbers and "C" indicates that the ORF is on the complementary strand): 



Name 


Start 


End 




orf7 


66 


1898 


(P 


hph3 


2416 


9909 




orf18 


3395 


2817 


C 


orf4 


9966 


14,633 




orf19 


14,699 


15,007 




OrfS 


15,171 


18,035 




orf22 


17,398 


17,072 


C 


OrflO 


19,167 


18,235 


C 


0rf14 


20,116 


19.385 


C 


orfl 3 


20,963 


20,217 


C 


Orf11 


23.086 


22,172 


C 


hph2 


23,768 


31,336 




orf2 


31 ,393 


35.838 





(partial sequence) 
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orf21 35,709 35,383 C 

orf16 36.661 36,032 C 

orf8 37.781 36.654 C 

SEQ ID NO:11 also includes the following restriction sites, some of which are used in the 
subcloning steps set forth in Example 17: 



Restriction Site 


Nucleotide Position(s) 


AcdW 


2835 


BamH\ 


18,915 


BsmB\ 


11,350 


Bst\ ^07\ 


29.684 


Eag\ 


13,590; 31,481 


Eco72^ 


34,474 


H/lli i\ 

nniui 


OAAA' Rlltf?- Q'a07» Oft or\A 


Non 


13,589 


Pad 


9915; 23,353; 37,888 


Pvu\ 


8816 


Sap\ 


35,248 


SexAl 


28,946 


Sgfl 


8815 


Spel 


2157; 3769; 7831; 11,168 


Sphl 


755 


StiA 


35,690 


Tth^^^\ 


21,443 



SEQ ID NO:12 is the sequence of the protein encoded by orfS of SEQ ID NO:1 1 . 
SEQ ID NO:13 is the sequence of the protein encoded by hph2 of SEQ ID NO:1 1 . 
SEQ ID NO;14 is the sequence of the protein encoded by orf2 of SEQ ID NO:1 1 . 
SEQ ID NOs:15-22 are oligonucleotides. 

DEPOSITS 

The following material has been deposited with the Agricultural Research Sen/ice. 
Patent Culture Collection (NRRL), 1815 North University Street, Peoria, Illinois 61604, under 
the terms of the Budapest Treaty on the International Recognition of the Deposit of 
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Microorganisms for the Purposes of Patent Procedure. All restrictions on the availability of 
the deposited material will be irrevocably removed upon the granting of a patent. 



Clone Accession Number Date of Deposit 

pCIB9359-7 NRRL B-21 835 September 17,1 997 

pNOV2400 NRRL B-30077 December 3, 1 998 

pNOV1 001 NRRL B-30078 December 3, 1 998 



Novel Nucleic Acid Seouences whose Expression Results in Insecticidal Toxins 

This invention relates to nucleic acid sequences whose expression results in novel 
toxins, and to the making and using of the toxins to control insect pests. The nucleic acid 
sequences are derived from Photorhabdus luminescens, a member of the 
Enterobacteriaceae family. P. luminescens is a symbiotic bacterium of nematodes of the 
genus Heterorhabditis, The nematodes colonize insect larva, kill them, and their offspring 
feed on the dead larvae. The insecticidal activity is actually produced by the symbiotic P. 
luminescens bacteria. The inventors are the first to isolate the nucleic acid sequences of 
the present invention from P. luminescens (ATCC strain number 29999). The expression of 
the nucleic acid sequences of the present invention results in toxins that can be used to 
control Lepidopteran insects such as Plutella xylostella (Diamondback Moth), Trichoplusia ni 
(Cabbage Looper), Ostrinia nubilalis (European Corn Borer), Heliothis virescens (Tobacco 
Budworm), Helicoverpa zea (Corn Earworm), Manduca sexta (Tobacco Hornworm), 
Spodoptera exigua (Beet Armyworm), and Spodoptera frugiperda (Fall Armyworm). as well 
as Coleopteran insects such as Diabrotica virgifera virgifera (Western Corn Rootworm), 
DIabrotica undecimpunctata howardi (Southern Corn Rootworm), Diabrotica longicomis 
barberi (Northern Corn Rootworm). and Leptinotarsa decimlineata (Colorado Potato 
Beetle). 

In one preferred embodiment, the invention encompasses an isolated nucleic acid 
molecule comprising a nucleotide sequence substantially similar to the approximately 9.7 kb 
nucleic acid sequence set forth in SEQ ID NO:1. whose expression results in insect control 
activity (further illustrated in Examples 1-11). Five open reading frames (ORFs) are present 
in the nucleic acid sequence set forth in SEQ ID NO:1 . coding for proteins of predicted sizes 
45 kDa, 28 kDa, 21 kDA, 29 kDa. and 176 kDa. The five ORFs are arranged in an operon- 
like structure. When expressed in a heterologous host, the - 9.7 kb DNA fragment from P. 
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luminescens results in insect control activity against Lepidopterans such as Plutella 
xylostella (Diamondback Moth), Trichoplusia ni (Cabbage Looper), Ostrinia nubilalis 
(European Corn Borer), Heliothis virescens (Tobacco Budworm), Helicoverpa zea (Corn 
Earworm), Spodoptera exigua (Beet Armyworm), and Spodoptera frugiperda (Fall 
Arnnyworm), showing that expression of the - 9.7 kb nucleotide sequence set forth in SEQ 
ID NO:1 is necessary and sufficient for such insect control activity. In a preferred 
embodiment, the invention encompasses a DNA molecule, whose expression results in an 
insecticidal toxin, which is deposited in the E. coli strain pCIB9359-7 (NRRL accession 
number B-21835). 

In another preferred embodiment, the invention encompasses an isolated nucleic 
acid molecule comprising a nucleotide sequence substantially similar to the approximately 
38 kb nucleic acid fragment set forth in SEQ ID NO: 11 and deposited in the E. co// strain 
pNOV2400 (NRRL accession number B-30077), whose expression results in insect control 
activity (see Examples 12-18), In a more preferred embodiment, the invention encompasses 
an isolated nucleic acid molecule comprising a nucleotide sequence substantially similar to 
the - 22 kb DNA fragment deposited in the E. coli strain pNOVIOOl (NRRL accession 
number B-30078). whose expression results in insect control activity. In a most preferred 
embodiment, the invention encompasses isolated nucleic acid molecules comprising 
nucleotide sequences substantially similar to the three ORFs corresponding to nucleotides 
23,768-31,336 (hph2), 31,393-35,838 (orf2). and 15,171-18,035 (orf5) of the DNA fragment 
set forth in SEQ ID NO:1 1 , as well as the proteins encoded thereby. When co-expressed in 
a heterologous host, these three ORFs result in insect control activity against Lepidopterans 
such as Plutella xylostella (Diamondback Moth), Ostrinia nubilalis (European Corn Borer), 
and Manduca sexta (Tobacco Hornworm), as well as against Coleopterans such as 
Diabrotica virgifera virgifera (Western Com Rootworm). Diabrotica undecimpunctata 
howardi (Southern Corn Rootworm), and Leptinotarsa decimlineata (Colorado Potato 
Beetle), showing that co-expression of these three ORFs (hph2, orf2. and orfS) is necessary 
and sufficient for such insect control activity. 

The present invention also encompasses recombinant vectors comprising the nucleic 
acid sequences of this invention. In such vectors, the nucleic acid sequences are preferably 
comprised in expression cassettes comprising regulatory elements for expression of the 
nucleotide sequences in a host cell capable of expressing the nucleotide sequences. Such 
regulatory elements usually comprise promoter and termination signals and preferably also 
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comprise elements allowing efficient translation of polypeptides encoded by the nucleic 
acid sequences of the present invention. Vectors comprising the nucleic acid sequences 
are usually capable of replication in particular host cells, preferably as extrachromosomal 
molecules, and are therefore used to amplify the nucleic acid sequences of this invention in 
the host cells. In one embodiment, host cells for such vectors are microorganisms, such as 
bacteria, in particular E.coli. In another embodiment, host cells for such recombinant vectors 
are endophytes or epiphytes. A preferred host cell for such vectors is a eukaryotic cell, 
such as a yeast, a plant cell, or an insect cell. Plant cells such as maize cells are most 
preferred host cells. In another preferred embodiment, such vectors are viral vectors and 
are used for replication of the nucleotide sequences in particular host cells, e.g. insect cells 
or plant cells. Recombinant vectors are also used for transformation of the nucleotide 
sequences of this invention into host cells, whereby the nucleotide sequences are stably 
integrated into the DNA of such host cells. In one, such host cells are prokaryotic cells. In a 
preferred embodiment, such host cells are eukaryotic cells, such as yeast cells, insect cells, 
or plant cells. In a most preferred embodiment, the host cells are plant cells, such as maize 
cells. 

In preferred embodiments, the insecticidal toxins of the invention each comprise at 
least one polypeptide encoded by a nucleotide sequence of the invention. In another 
preferred embodiment, the insecticidal toxins are produced from a purified strain of P. 
luminescens, such the strain with ATTC accession number 29999. The toxins of the 
present invention have insect control activity when tested against insect pests in bioassays; 
and these properties of the insecticidal toxins are further illustrated in Examples 1-18. The 
insecticidal toxins desribed in the present invention are further characterized in that their 
molecular weights are larger than 6,000, as found by size fractionation experiments. The 
insecticidal toxins retain full insectidical activity after being stored at 4°C for 2 weeks. One 
is also shown to retain its full insecticidal activity after being freeze-dried and stored at 22'*C 
for 2 weeks. However, the insecticidal toxins of the invention lose their insecticidal activity 
after incubation for 5 minutes at 100'*C. 

In further embodiments, the nucleotide sequences of the invention can be modified 
by incorporation of random mutations in a technique known as in-vitro recombination or 
DNA shuffling. This technique is described in Stemmer et al., Nature 370: 389-391 (1994) 
and US Patent 5,605,793, which are incorporated herein by reference. Millions of mutant 
copies of a nucleotide sequence are produced based on an original nucleotide sequence of 
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this invention and variants with improved properties, such as increased insecticidal activity, 
enhanced stability, or different specificity or range of target insect pests are recovered. The 
method encompasses forming a mutagenized double-stranded polynucleotide from a 
template double-stranded polynucleotide comprising a nucleotide sequence of this 
invention, wherein the template double-stranded polynucleotide has been cleaved into 
double-stranded-random fragments of a desired size, and comprises the steps of adding to 
the resultant population of double-stranded random fragments one or more single or 
double-stranded oligonucleotides, wherein said oligonucleotides comprise an area of 
identity and an area of heterology to the double-stranded template polynucleotide; 
denaturing the resultant mixture of double-stranded random fragments and oligonucleotides 
into single-stranded fragments; incubating the resultant population of single-stranded 
fragments with a polymerase under conditions which result in the annealing of said single- 
stranded fragments at said areas of identity to form pairs of annealed fragments, said areas 
of identity being sufficient for one member of a pair to prime replication of the other, thereby 
forming a mutagenized double-stranded polynucleotide; and repeating the second and third 
steps for at least two further cycles, wherein the resultant mixture in the second step of a 
further cycle includes the mutagenized double-stranded polynucleotide from the third step 
of the previous cycle, and the further cycle forms a further mutagenized double-stranded 
polynucleotide. In a preferred embodiment, the concentration of a single species of double- 
stranded random fragment in the population of double-stranded random fragments is less 
than 1% by weight of the total DNA. In a further preferred embodiment, the template 
double-stranded polynucleotide comprises at least about 100 species of polynucleotides. In 
another preferred embodiment, the size of the double-stranded random fragments is from 
about 5 bp to 5 kb. In a further preferred embodiment, the fourth step of the method 
comprises repeating the second and the third steps for at least 10 cycles. 

Expression of the Nucleotide Sequences in Heterologous Microbial Hosts 

As biological insect control agents, the insecticidal toxins are produced by expression 
of the nucleotide sequences in heterologous host cells capable of expressing the nucleotide 
sequences. In a first embodiment, P. luminescens cells comprising modifications of at least 
one nucleotide sequence of this invention at its chromosomal location are described. Such 
modifications encompass mutations or deletions of existing regulatory elements, thus 
leading to altered expression of the nucleotide sequence, or the incorporation of new 
regulatory elements controlling the expression of the nucleotide sequence. In another 
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embodiment, additional copies of one or more of the nucleotide sequences are added to P. 
luminescens cells either by insertion into the chromosome or by introduction of 
extrachromosomally replicating molecules containing the nucleotide sequences. 

In another embodiment, at least one of the nucleotide sequences of the invention is 
inserted into an appropriate expression cassette, comprising a promoter and termination 
signals. Expression of the nucleotide sequence is constitutive, or an inducible promoter 
responding to various types of stimuli to initiate transcription is used. In a preferred 
embodiment, the cell in which the toxin is expressed is a microorganism, such as a virus, a 
bacteria, or a fungus. In a preferred embodiment, a virus, such as a baculovirus, contains a 
nucleotide sequence of the invention in its genome and expresses large amounts of the 
corresponding insecticidal toxin after infection of appropriate eukaryotic cells that are 
suitable for virus replication and expression of the nucleotide sequence. The insecticidal 
toxin thus produced is used as an insecticidal agent. Alternatively, baculoviruses 
engineered to include the nucleotide sequence are used to infect insects in-vivo and kill 
them either by expression of the insecticidal toxin or by a combination of viral infection and 
expression of the insecticidal toxin. 

Bacterial cells are also hosts for the expression of the nucleotide sequences of the 
invention. In a preferred embodiment, non-pathogenic symbiotic bacteria, which are able to 
live and replicate within plant tissues, so-called endophytes, or non-pathogenic symbiotic 
bacteria, which are capable of colonizing the phyllosphere or the rhizosphere, so-called 
epiphytes, are used. Such bacteria include bacteria of the genera Agrobacterium, 
Alcaligenes, Azospirillum, Azotobacter, Bacillus, Clavibacter, Enterobacter, Erwinia, 
Flavobacter, Klebsiella, Pseudomonas, Rhizobium, Serratia, Streptomyces and 
Xanthomonas. Symbiotic fungi, such as Trichoderma and Gliocladium are also possible 
hosts for expression of the inventive nucleotide sequences for the same purpose. 

Techniques for these genetic manipulations are specific for the different available 
hosts and are known in the art. For example, the expression vectors pKK223-3 and 
pKK223-2 can be used to express heterologous genes in E. coil, either in transcriptional or 
translational fusion, behind the tac or trc promoter. For the expression of operons encoding 
multiple ORFs, the simplest procedure is to insert the operon into a vector such as pKK223- 
3 in transcriptional fusion, allowing the cognate ribosome binding site of the heterologous 
genes to be used. Techniques for overexpression in gram-positive species such as Bacillus 
are also known in the art and can be used in the context of this invention (Quax at al. In.: 
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Industrial Microorganisms: Basic and Applied Molecular Genetics, Eds. Baltz et aL, 
American Society for Microbiology, Washington (1993)). Alternate systems for 
overexpression rely for example, on yeast vectors and include the use of Pichia, 
Saccharomyces and Kluyveromyces (Sreekrishna, In: Industrial microorganisms: basic and 
applied molecular genetics, Baltz, Hegeman, and Skatrud ecfs., American Society for 
Microbiology, Washington (1993); Dequin & Barre, Biotechnology 12:173-177 (1994); van 
den Berg et al., Biotechnology 8:135-139 (1990)). 

In another preferred embodiment, at least one of the described nucleotide 
sequences is transferred to and expressed in Pseudomonas fluorescens strain CGA267356 
(described in the published application EU 0 472 494 and in WO 94/01561) which has 
biocontrol characteristics. In another preferred embodiment, a nucleotide sequence of the 
invention is transferred to Pseudomonas aureofaciens strain 30-84 which also has 
biocontrol characteristics. Expression in heterologous biocontrol strains requires the 
selection of vectors appropriate for replication in the chosen host and a suitable choice of 
promoter. Techniques are well known in the art for expression in gram-negative and gram- 
positive bacteria and fungi. 

Expression of the Nucleotide Sequences in Plant Tissue 

In a particularly preferred embodiment, at least one of the insecticidal toxins of the 
invention is expressed in a higher organism, e.g., a plant. In this case, transgenic plants 
expressing effective amounts of the toxins protect themselves from insect pests. When the 
insect starts feeding on such a transgenic plant, it also ingests the expressed toxins. This 
will deter the insect from further biting into the plant tissue or may even harm or kill the 
insect. A nucleotide sequence of the present invention is inserted into an expression 
cassette, which is then preferably stably integrated in the genome of said plant. In another 
preferred embodiment, the nucleotide sequence is included in a non-pathogenic self- 
replicating virus. Plants transformed in accordance with the present invention may be 
monocots or dicots and include, but are not limited to, maize, wheat, bariey, rye, sweet 
potato, bean, pea, chicory, lettuce, cabbage, cauliflower, broccoli, turnip, radish, spinach, 
asparagus, onion, gariic, pepper, celery, squash, pumpkin, hemp, zucchini, apple, pear, 
quince, melon, plum, cherry, peach, nectarine, apricot, strawberry, grape, raspberry, 
blackberry, pineapple, avocado, papaya, mango, banana, soybean, tomato, sorghum, 
sugarcane, sugarbeet, sunflower, rapeseed, clover, tobacco, carrot, cotton, alfalfa, rice, 
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potato, eggplant, cucumber, Arabidopsis, and woody plants such as coniferous and 
deciduous trees. 

Once a desired nucleotide sequence has been transformed into a particular plant 
species, it may be propagated in that species or moved into other varieties of the same 
species, particularly including commercial varieties, using traditional breeding techniques. 

A nucleotide sequence of this invention is preferably expressed in transgenic plants, 
thus causing the biosynthesis of the corresponding toxin in the transgenic plants. In this 
way, transgenic plants with enhanced resistance to insects are generated. For their 
expression in transgenic plants, the nucleotide sequences of the invention may require 
modification and optimization. Although in many cases genes from microbial organisms can 
be expressed in plants at high levels without modification, low expression in transgenic 
plants may result from microbial nucleotide sequences having codons that are not preferred 
in plants. It is known in the art that all organisms have specific preferences for codon 
usage, and the codons of the nucleotide sequences described in this invention can be 
changed to conform with plant preferences, while maintaining the amino acids encoded 
thereby. Furthermore, high expression in plants is best achieved from coding sequences 
that have at least 35% about GC content, preferably more than about 45%, more preferably 
more than about 50%, and most preferably more than about 60%. Microbial nucleotide 
sequences which have low GC contents may express poorly in plants due to the existence 
of ATTTA motifs which may destabilize messages, and AATAAA motifs which may cause 
inappropriate polyadenylation. Although preferred gene sequences may be adequately 
expressed in both monocotyledonous and dicotyledonous plant species, sequences can be 
modified to account for the specific codon preferences and GC content preferences of 
monocotyledons or dicotyledons as these preferences have been shown to differ (Murray et 
al. Nucl. Acids Res. 17: 477-498 (1989)). In addition, the nucleotide sequences are 
screened for the existence of illegitimate splice sites that may cause message truncation. 
All changes required to be made within the nucleotide sequences such as those described 
above are made using well known techniques of site directed mutagenesis, PCR, and 
synthetic gene construction using the methods described in the published patent 
applications EP 0 385 962 (to Monsanto), EP 0 359 472 (to Lubrizol. and WO 93/07278 (to 
Ciba-Geigy). 

For efficient initiation of translation, sequences adjacent to the initiating methionine 
may require modification. For example, they can be modified by the inclusion of sequences 
known to be effective in plants. Joshi has suggested an appropriate consensus for plants 
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(NAR 15: 6643-6653 (1987)) and Clontech suggests a further consensus translation 
initiator (1993/1994 catalog, page 210). These consensuses are suitable for use with the 
nucleotide sequences of this invention. The sequences are incorporated into constructions 
comprising the nucleotide sequences, up to and including the ATG (whilst leaving the 
second amino acid unmodified), or alternatively up to and including the GTC subsequent to 
the ATG (with the possibility of modifying the second amino acid of the transgene). 

Expression of the nucleotide sequences in transgenic plants is driven by promoters 
shown to be functional in plants. The choice of promoter will vary depending on the 
temporal and spatial requirements for expression, and also depending on the target 
species. Thus, expression of the nucleotide sequences of this invention in leaves, in ears, in 
inflorescences (e.g. spikes, panicles, cobs, etc.), in roots, and/or seedlings is preferred. In 
many cases, however, protection against more than one type of insect pest is sought, and 
thus expression in multiple tissues is desirable. Although many promoters from 
dicotyledons have been shown to be operational in monocotyledons and vice versa, ideally 
dicotyledonous promoters are selected for expression in dicotyledons, and 
monocotyledonous promoters for expression in monocotyledons. However, there is no 
restriction to the provenance of selected promoters; it is sufficient that they are operational 
in driving the expression of the nucleotide sequences in the desired cell. 

Preferred promoters that are expressed constitutively include promoters from genes 
encoding actin or ubiquitin and the CaMV 35S and 19S promoters. The nucleotide 
sequences of this invention can also be expressed under the regulation of promoters that 
are chemically regulated. This enables the insecticidal toxins to be synthesized only when 
the crop plants are treated with the inducing chemicals. Preferred technology for chemical 
induction of gene expression is detailed in the published application EP 0 332 104 (to Ciba- 
Geigy) and US patent 5,614,395. A preferred promoter for chemical induction is the 
tobacco PR-la promoter. 

A preferred category of promoters is that which is wound inducible. Numerous 
promoters have been described which are expressed at wound sites and also at the sites of 
phytopathogen infection. Ideally, such a promoter should only be active locally at the sites 
of infection, and in this way the insecticidal toxins only accumulate in cells which need to 
synthesize the insecticidal toxins to kill the invading insect pest. Preferred promoters of this 
kind include those described by Stanford et ai Mol. Gen. Genet. 215: 200-208 (1 989). Xu et 
ai Plant Molec. Biol. 22: 573-588 (1993), Logemann et ai Plant Cell 1: 151-158 (1989). 
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Rohrmeier & Lehle, Plant Molec. Biol. 22: 783-792 (1993), Firek et ai Plant Molec. Biol. 22: 
129-142(1993). and Warner et aL Plant J. 3: 191-201 (1993). 

Preferred tissue specific expression patterns include green tissue specific, root 
specific, stem specific, and flower specific. Promoters suitable for expression in green 
tissue include many which regulate genes involved in photosynthesis and many of these 
have been cloned from both monocotyledons and dicotyledons. A preferred promoter is the 
maize PEPC promoter from the phosphoenol carboxylase gene (Hudspeth & Grula, Plant 
Molec. Biol. 12: 579-589 (1989)). A preferred promoter for root specific expression is that 
described by de Framond (FEBS 290 : 103-106 (1991); EP 0 452 269 to Ciba-Geigy). A 
preferred stem specific promoter is that described in US patent 5,625,136 (to Ciba-Geigy) 
and which drives expression of the maize trpA gene. 

Especially preferred embodiments of the invention are transgenic plants expressing 
at least one of the nucleotide sequences of the invention in a root-preferred or root-specific 
fashion. Further preferred embodiments are transgenic plants expressing the nucleotide 
sequences in a wound-inducible or pathogen infection-inducible manner. 

In addition to the selection of a suitable promoter, constructions for expression of an 
insecticidal toxin in plants require an appropriate transcription terminator to be attached 
downstream of the heterologous nucleotide sequence. Several such terminators are 
available and known in the art (e.g. tm1 from CaMV, E9 from rbcS), Any available 
terminator known to function in plants can be used in the context of this invention. 

Numerous other sequences can be incorporated into expression cassettes 
described in this invention. These include sequences which have been shown to enhance 
expression such as intron sequences (e.g. from Adh1 and bronzel) and viral leader 
sequences (e.g. from TMV, MCMV and AMV). 

It may be preferable to target expression of the nucleotide sequences of the present 
invention to different cellular localizations in the plant. In some cases, localization in the 
cytosol may be desirable, whereas in other cases, localization in some subcellular organelle 
may be preferred. Subcellular localization of transgene encoded enzymes is undertaken 
using techniques well known in the art. Typically, the DNA encoding the target peptide from 
a known organelle-targeted gene product is manipulated and fused upstream of the 
nucleotide sequence. Many such target sequences are known for the chloroplast and their 
functioning in heterologous constructions has been shown. The expression of the 
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nucleotide sequences of the present invention is also targeted to the endoplasmic reticulum 
or to the vacuoles of the host cells. Techniques to achieve this are well-known in the art. 

Vectors suitable for plant transformation are described elsewhere in this 
specification. For Agrobacteriunrhmeii'iatedi transformation, binary vectors or vectors 
carrying at least one T-DNA border sequence are suitable, whereas for direct gene transfer 
any vector is suitable and linear DNA containing only the construction of interest may be 
preferred. In the case of direct gene transfer, transformation with a single DNA species or 
co-transformation can be used (Schocher et al. Biotechnology 4: 1093-1096 (1986)). For 
both direct gene transfer and Agrobacteriurrhmediated transfer, transformation is usually 
(but not necessarily) undertaken with a selectable marker which may provide resistance to 
an antibiotic (kanamycin, hygromycin or methotrexate) or a herbicide (basta). The choice of 
selectable marker is not, however, critical to the invention. 

In another preferred embodiment, a nucleotide sequence of the present invention is 
directly transfonmed into the plastid genome. A major advantage of plastid transformation is 
that plastids are generally capable of expressing bacterial genes without substantial 
modification, and plastids are capable of expressing multiple open reading frames under 
control of a single promoter. Plastid transformation technology is extensively described in 
U.S. Patent Nos. 5,451,513. 5,545,817, and 5,545,818, in PCT application no. WO 
95/16783, and in McBride etal. (1994) Proc. Natl. Acad. Sci. USA 91, 7301-7305. The basic 
technique for chloroplast transformation involves introducing regions of cloned plastid DNA 
flanking a selectable marker together with the gene of interest into a suitable target tissue, 
e.g., using biolistics or protoplast transformation (e.g., calcium chloride or PEG mediated 
transformation). The 1 to 1.5 kb flanking regions, termed targeting sequences, facilitate 
homologous recombination with the plastid genome and thus allow the replacement or 
modification of specific regions of the plastome. Initially, point mutations in the chloroplast 
16S rRNA and rps12 genes conferring resistance to spectinomycin and/or streptomycin are 
utilized as selectable markers for transformation (Svab, Z., Hajdukiewicz. P., and Maliga, P. 
(1990) Proc. Natl. Acad. Sci. USA 87. 8526-8530; Staub. J. M.. and Maliga. P. (1992) Plant 
Cell 4, 39-45). This resulted in stable homoplasmic transformants at a frequency of 
approximately one per 100 bombardments of target leaves. The presence of cloning sites 
between these markers allowed creation of a plastid targeting vector for introduction of 
foreign genes (Staub, J.M., and Maliga, P. (1993) EMBO J. 12. 601-606). Substantial 
increases in transformation frequency are obtained by replacement of the recessive rRNA 
or r-protein antibiotic resistance genes with a dominant selectable marker, the bacterial 
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aadA gene encoding the spectinomycin-detoxifying enzyme aminoglycoside-3'- 
adenyltransferase (Svab, Z., and Maliga, P. (1993) Proc. Natl. Acad. Sci. USA 90, 913-917). 
Previously, this mariner had been used successfully for high-frequency transformation of the 
plastid genome of the green alga Chlamydomonas reinhardtii (Goldschmidt-Clermont, M. 
(1991) Nucl. Acids Res. 19: 4083-4089). Other selectable markers useful for plastid 
transfomnation are known in the art and encompassed within the scope of the invention. 
Typically, approximately 15-20 cell division cycles following transformation are required to 
reach a homoplastidic state. Plastid expression, in which genes are inserted by homologous 
recombination into ail of the several thousand copies of the circular plastid genome present 
in each plant cell, takes advantage of the enormous copy number advantage over nuclear- 
expressed genes to permit expression levels that can readily exceed 10% of the total 
soluble plant protein. In a preferred embodiment, a nucleotide sequence of the present 
invention is inserted into a plastid targeting vector and transformed into the plastid genome 
of a desired plant host. Plants homoplastic for plastid genomes containing a nucleotide 
sequence of the present invention are obtained, and are preferentially capable of high 
expression of the nucleotide sequence. 

Formulation of Insecticidal Compositions 

The invention also includes compositions comprising at least one of the insecticidal 
toxins of the present invention. In order to effectively control insect pests such compositions 
preferably contain sufficient amounts of toxin. Such amounts vary depending on the crop to 
be protected, on the particular pest to be targeted, and on the environmental conditions, 
such as humidity, temperature or type of soil. In a preferred embodiment, compositions 
comprising the insecticidal toxins comprise host cells expressing the toxins without 
additional purification. In another preferred embodiment, the cells expressing the 
insecticidal toxins are lyophilized prior to their use as an insecticidal agent. In another 
embodiment, the insecticidal toxins are engineered to be secreted from the host cells. In 
cases where purification of the toxins from the host cells in which they are expressed is 
desired, various degrees of purification of the insecticidal toxins are reached. 

The present invention further embraces the preparation of compositions comprising 
at least one Insecticidal toxin of the present invention, which is homogeneously mixed with 
one or more compounds or groups of compounds described herein. The present invention 
also relates to methods of treating plants, which comprise application of the insecticidal 
toxins or compositions containing the insecticidal toxins, to plants. The insecticidal toxins 
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can be applied to the crop area in the form of compositions or plant to be treated, 
simultaneously or in succession, with further compounds. These compounds can be both 
fertilizers or micronutrient donors or other preparations that influence plant growth. They 
can also be selective herbicides, insecticides, fungicides, bactericides, nematicides, 
molluscicides or mixtures of several of these preparations, if desired together with further 
earners, surfactants or application-promoting adjuvants customarily employed in the art of 
formulation. Suitable carriers and adjuvants can be solid or liquid and correspond to the 
substances ordinahly employed in formulation technology, e.g. natural or regenerated 
mineral substances, solvents, dispersants, wetting agents, tackifiers, binders or fertilizers. 

A preferred method of applying insecticidal toxins of the present invention is by 
spraying to the environment hosting the insect pest like the soil, water, or foliage of plants. 
The number of applications and the rate of application depend on the type and intensity of 
infestation by the insect pest. The insecticidal toxins can also penetrate the plant through 
the roots via the soil (systemic action) by impregnating the locus of the plant with a liquid 
composition, or by applying the compounds in solid form to the soil. e.g. in granular form 
(soil application). The insecticidal toxins may also be applied to seeds (coating) by 
impregnating the seeds either with a liquid formulation containing insecticidal toxins, or 
coating them with a solid formulation. In special cases, further types of application are also 
possible, for example, selective treatment of the plant stems or buds. The insecticidal 
toxins can also be provided as bait located above or below the ground. 

The insecticidal toxins are used in unmodified form or. preferably, together with the 
adjuvants conventionally employed in the art of formulation, and are therefore formulated in 
known manner to emulsifiable concentrates, coatable pastes, directly sprayable or dilutable 
solutions, dilute emulsions, wettable powders, soluble powders, dusts, granulates, and also 
encapsulations, for example, in polymer substances. Like the nature of the compositions, 
the methods of application, such as spraying, atomizing, dusting, scattering or pouring, are 
chosen in accordance with the intended objectives and the prevailing circumstances. 

The formulations, compositions or preparations containing the insecticidal toxins 
and, where appropriate, a solid or liquid adjuvant, are prepared in known manner, for 
example by homogeneously mixing and/or grinding the insecticidal toxins with extenders, 
for example solvents, solid carriers and, where appropriate, surface-active compounds 
(surfactants). 

Suitable solvents include aromatic hydrocarbons, preferably the fractions having 8 to 
12 carbon atoms, for example, xylene mixtures or substituted naphthalenes, phthalates 
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such as dibutyl phthalate or dioctyl phthalate. aliphatic hydrocarbons such as cyclohexane 
or paraffins, alcohols and glycols and their ethers and esters, such as ethanol, ethylene 
glycol monomethyl or monoethyl ether, ketones such as cyclohexanone, strongly polar 
solvents such as N-methyl-2-pyrrolidone, dimethyl sulfoxide or dimethyl formamide, as well 
as epoxidized vegetable oils such as epoxidized coconut oil or soybean oil or water. 

The solid carriers used e.g. for dusts and dispersible powders, are normally natural 
mineral fillers such as calcite, talcum, kaolin, montmorillonite or attapulgite. In order to 
improve the physical properties it is also possible to add highly dispersed silicic acid or 
highly dispersed absorbent polymers. Suitable granulated adsorptive carriers are porous 
types, for example pumice, broken brick, sepiolite or bentonite; and suitable nonsorbent 
carriers are materials such as calcite or sand. In addition, a great number of pregranulated 
materials of inorganic or organic nature can be used, e.g. especially dolomite or pulverized 
plant residues. 

Suitable surface-active compounds are nonionic, cationic and/or anionic surfactants 
having good emulsifying, dispersing and wetting properties. The term "surfactants" will also 
be understood as comprising mixtures of surfactants. Suitable anionic surfactants can be 
both water-soluble soaps and water-soluble synthetic surface-active compounds. 

Suitable soaps are the alkali metal salts, alkaline earth metal salts or unsubstituted 
or substituted ammonium salts of higher fatty acids (chains of 10 to 22 carbon atoms), for 
example the sodium or potassium salts of oleic or stearic acid, or of natural fatty acid 
mixtures which can be obtained for example from coconut oil or tallow oil. The fatty acid 
methyltaurin salts may also be used. 

More frequently, however, so-called synthetic surfactants are used, especially fatty 
sulfonates, fatty sulfates, sulfonated benzimidazole derivatives or alkylarylsulfonates. 

The fatty sulfonates or sulfates are usually in the form of alkali metal salts, alkaline 
earth metal salts or unsubstituted or substituted ammonium salts and have a 8 to 22 carbon 
alkyi radical which also includes the alkyi moiety of alkyi radicals, for example, the sodium 
or calcium salt of lignonsuifonic acid, of dodecylsulfate or of a mixture of fatty alcohol 
sulfates obtained from natural fatty acids. These compounds also comprise the salts of 
sulfuric acid esters and sulfonic acids of fatty alcohol/ethylene oxide adducts. The 
sulfonated benzimidazole derivatives preferably contain 2 sulfonic acid groups and one fatty 
acid radical containing 8 to 22 carbon atoms. Examples of alkylarylsulfonates are the 
sodium, calcium or triethanolamine salts of dodecylbenzenesulfonic acid, 
dibutylnapthalenesulfonic acid, or of a naphthalenesulfonic acid/formaldehyde 
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condensation product. Also suitable are corresponding phosphates, e.g. salts of the 
phosphoric acid ester of an adduct of p-nonylphenol with 4 to 14 moles of ethylene oxide. 

Non-ionic surfactants are preferably polyglycol ether derivatives of aliphatic or 
cycloaliphatic alcohols, or saturated or unsaturated fatty acids and alkylphenols, said 
derivatives containing 3 to 30 glycol ether groups and 8 to 20 carbon atoms in the (aliphatic) 
hydrocarbon moiety and 6 to 1 8 carbon atoms in the alkyi moiety of the alkylphenols. 

Further suitable non-ionic surfactants are the water-soluble adducts of polyethylene 
oxide with polypropylene glycol, ethylenediamine propylene glycol and alkylpolypropylene 
glycol containing 1 to 10 carbon atoms in the alkyI chain, which adducts contain 20 to 250 
ethylene glycol ether groups and 10 to 100 propylene glycol ether groups. These 
compounds usually contain 1 to 5 ethylene glycol units per propylene glycol unit. 

Representative examples of non-ionic surfactants are 
nonylphenolpolyethoxyethanols, castor oil polyglycol ethers, polypropylene/polyethylene 
oxide adducts, tributylphenoxypolyethoxyethanol, polyethylene glycol and 
octylphenoxyethoxyethanol. Fatty acid esters of polyoxyethylene sorbitan and 
polyoxyethylene sorbitan trioleate are also suitable non-ionic surfactants. 

Cationic surfactants are preferably quaternary ammonium salts which have, as N- 
substituent. at least one C8-C22 alkyI radical and, as further substituents, lower 
unsubstituted or halogenated alkyI, benzyl or lower hydroxyalkyi radicals. The salts are 
preferably in the form of halides, methylsulfates or ethylsulfates, e.g. 
stearyltrimethylammonium chloride or benzyldi(2-chloroethyl)ethylammonium bromide. 

The surfactants customarily employed in the art of formulation are described, for 
example, in "McCutcheon's Detergents and Emulsifiers Annual." MC Publishing Corp. 
Ringwood, New Jersey, 1979, and Sisely and Wood. "Encyclopedia of Surface Active 
Agents." Chemical Publishing Co., Inc. New York, 1980. 
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EXAMPLES 

The invention will be further described by reference to the following detailed 
examples. These examples are provided for purposes of illustration only, and are not 
intended to be limiting unless otherwise specified. Standard recombinant DNA and 
molecular cloning techniques used here are well known in the art and are described by 
Ausubel (ed.), Current Protocols in Molecular Biology, John Wiley and Sons, Inc. (1994); T. 
Maniatis, E. F. Fritsch and J. Sambrook, Molecular Cloning: A Laboratory Manual, Cold 
Spring Harbor laboratory, Cold Spring Harbor, NY (1989); and by T.J. Silhavy, M.L. Berman, 
and L.W. Enquist, Experiments with Gene Fusions, Cold Spring Harbor Laboratory, Cold 
Spring Harbor. NY (1984). 

A. Isolation Of Nucieotide Sequences Whose Expression Results In Toxins Active 
Against Lepidopteran Insects 

Example 1 : Construction of Cosmid Library from Photorhabdus luminescens 

Photorhabdus luminescens strain ATCC 29999 is grown in nutrient broth at 25°C for 
three days as described in the ATCC protocol for bioassay. The culture is grown tor 24 
hours for DNA isolation. Total DNA is isolated by treating freshly grown cells resuspended 
in 100 mM Tris pH 8, 10 mM EDTA with 2 mg/mi lysozyme for 30 minutes at 37°C. 
Proteinase K is added to a final concentration of 100 mg/ml, SDS is added to a final 
concentration of 0.5% SDS and the sample is incubated at AS'^C. After the solution 
becomes clear and viscous, the SDS concentration is raised to 1%, and 300 mM NaCI and 
an equal volume of phenol-chloroform-isoamyl alcohol are added, mixed gently for 5 
minutes and centrifuged at 3K. The phenol-chloroform-isoamyl alcohol extraction is 
repeated twice. The aqueous phase is mixed with 0.7 volumes isopropanol. and the sample 
is centrifuged. The pellet is washed 3 times with 70% ethanol and the nucleic acids are 
gently resuspended in 0.5X TE. 

The DNA is treated with 0.3 units of Sau3A per mg DNA at 37°C for 3.5 minutes in 
100 ml volume containing a total of 6 mg DNA. The reaction is then heated for 30 minutes 
at 65**C to inactivate the enzyme. Then 2 units of Calf Intestinal Alkaline Phosphatase are 
added and incubated for 30 minutes at 37**C. The sample is mixed with an equal volume of 
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phenol-chloroform-isoamyl alcohol and centrifuged. The aqueous phase is removed, 
precipitated with 0.7 volume isopropanol and centrifuged. The supernatant is transferred to 
a fresh tube, precipitated with ethanol, and the nucleic acids are resuspended in 0.5X TE at 
a concentration of 100 hg/ml. 

SuperCos cosmid vector (Stratagene, La Jolla, CA) is prepared as described by the 
supplier utilizing the BamHI cloning site. Prepared SuperCos at 100 hg/ml is ligated with the 
Sau3A digested P.luminescens DNA at a molar ratio of 2:1 in a 5 ml volume overnight at 
6°C. The ligation mixture is packaged using Gigapack XL III (Stratagene), as described by 
the supplier. Packaged phages are used to infect XL-1MR (Stratagene) cells as described 
by the supplier. The cosmid library is plated on L-agar with 50 mg/ml kanamycin and 
incubated 16 hours at SVC. 500 colonies are patched onto fresh L-kan plates at 50 
colonies per plate. From the other plates the cells are washed off with L broth and mixed 
with 20% glycerol and frozen at -80°C. 

Example 2: Insect Bioassays 

Plutella xylostella bioassays are performed by aliquoting of 50 |liI of the E. coli culture 
on the solid artificial Plutella xylostella diet (Biever and Boldt, Annals of Entomological 
Society of America, 1971; Shelton et al., J. Ent ScL 26:17). 4 ml of the diet is poured into 1 
oz, clear plastic cups (Bioserve product #9051). 5 neonate P. xylostella from a diet adapted 
lab colony are placed in each diet-containing cup and then covered with a white paper lid 
(Bioserve product #9049). 10 lan/ae are assayed per concentration. Trays of cups are 
placed in an incubator for 3 days at 72°F with a 14:10 (hours) light:dark cycle. Then, the 
number of live larvae in each cup is recorded. Bioassays for other insects are performed as 
described for Plutella xylostella, but using the diet required by the insect to be tested. 

The broth of P. luminescens undiluted and diluted 1:100 gives 100% mortality 
against P. xylostella. The broth of P. luminescens also gives 100% mortality against 
Diabrotica virgifera virgifera. Three clones with activity against P. xylostella and Heliothis 
virescens are obtained after screening 500 £ coli clones by insect bioassay. These cosmid 
clones are given the numbers pCIB9349, pCIB9350, and pCIB9351. 
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Example 3: Isolation of the NucI otid Sequence Responsible for insect C ntroi 
Activity from Clones pCIB9349, pCIB9350, and pCIB9351 

The three clones pCIB9349, pCIB9350 and pCIB9351 are found to be overlapping 
cosmids by restriction enzyme mapping. After digestion with Pad, clones pCIB9349 and 
pCIB9351 give two DNA fragments each, and pCIB9350 gives three DNA fragments. Each 
fragment is isolated and is self-ligated. The enzyme Pad does not cut the SuperCos vector; 
therefore, only fragments linked to it are re-isolated. The ligation mixtures are transformed 
into DH5a E. coli cells. Isolated transformed bacterial colonies are grown in L broth with 50 
^g/ml kanamycin, and plasmid DNA is isolated by using the alkaline miniprep protocol as 
described in Sambrook, et al. DNA is digested with Notl/PacI and two clones, pCIB9355 
and pCIB9356. are found by bioassay to still contain the insecticidal activity. Clone 
pCIB9355 is digested with Nott and a 17 kb and a 4 kb DNA fragment are generated. The 
17 kb fragment is isolated and ligated into Bluescript vector previously cut with Not\ and 
transformed into DH5a E. coli cells. The isolated transformed bacterial colonies are grown 
as described and plasmid DNA is isolated by the alkaline miniprep protocol. A clone 
containing the 17 kb insert is named pCIB9359 and tested by bioassay. The results are 
shown in Example 5. 3 |Lig of the 17 kb insert is isolated and treated with 0.3 unit of Sau3A 
per jig DNA for 4, 6, and 8 minutes at 37°C, heated at 75°C for 15 minutes. The samples 
are pooled and ligated into pUC19 previously cut with BamHI and treated with calf intestinal 
alkaline phosphatase. The ligation is transformed into DH5a cells and plated on L agar with 
Xgal/Amp as described in Sambrook et al. and grown overnight at 37°C. White colonies 
are picked and grown in L broth with 100 jig/ml and plasmid DNA is isolated as previously 
described. DNA is digested with EcoRI/Hindlll and novel restriction patterns are sequenced. 
Sequencing primers are ordered from Genosys Biotechnologies (Woodlands, TX). 
Sequencing is performed using the dideoxy chain-termination method. Sequencing is 
completed using Applied Biosystems Inc. model 377 automated DNA sequencer (Foster 
City. CA). Sequence is assembled using 3.0 from Gene Codes Corporation (Ann Arbor, 
Ml). 
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Exampi 4: Subcloning f th 9.7 kb EcoRI/Xbal Fragment Fr m pCIB9359 

pCIB9359 is digested with EcoRI and Xbal and the DNA is run on a 0.8% 
Seaplaque/TBE gel. The 9.7 kb fragment (SEQ ID NO:1) is isolated and ligated into pUC19 
previously digested with EcoRI and XbaL The ligation mixture is transformed into DH5a E. 
coli cells. Transformed bacteria are grown and plasmid DNA is isolated as previously 
described. The vector containing the 9.7 kb fragment in pUC19 is designated pCIB9359-7 
and bioassay results are shown in Example 5. 

Example 5: Bioassay Results for Cosmid Clones pCIB9359 and pCIB9359-7 



Cultures of E. coli strains 9359 and 9359-7 containing clones pCIB9359 and 
pCIB9359-7, respectively, are tested for insecticidal activity against the following insects in 
insect bioassays: 



Insects 


Clones 




PCIB9359 and pCIB9359-7 


Plutella xylostella (Diamondback Moth (DBM)) 


+++ 


Heliothis virescens (Tobacco Budworm (TBW)) 


++ 


Helicoverpa zea (Corn Earworm (CEW)) 


+++ 


Spodoptera exigua (Beet Armyworm (BAW)) 


+ 


Spodoptera frugiperda (Fall Amiyworm (PAW)) 


+ 


Trichoplusia n/ (Cabbage Looper (CL)) 


+++ 


Ostrinia nubilalis (European Corn Borer (ECB)) 


++ 


Manduca sexta (Tobacco Hornworm (THW) 


na 


Diabrotica virgifera (Western Corn Rootworm (WCR)) 


na 


Agrotis ipsilon (Black Cutworm (BCW)) 


na 



na = not active 

+ = significant growth inhibition 

++ = >40% mortality, but less than 100% 

+4"f = 1 00% mortality 
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The clones show insecticidal activity against P. xylostella, H. virescens, H. zea, T. ni, 
and O. nubilalis, and significant insect control activity against S. exigua and S. frugiperda. 

Example 6: Identification of Active Region of pCIB9359-7 By Subclontng 



Cultures of E. coli strains containing subclones of pCIB9359-7 are tested for 
insecticidal activity in insect bioassays against P. xylostella. 



Restriction 


Nucleotide Position Relative to 9.7 kb 


insecticidal Activity Against 


Fragment 


EcoRUXba\ fragment (SEQ ID NO:1) 


Plutella xylosteUa 




from pCIB9539-7 and Size in kb 




EcoRl/Xbal 


1 to 9712 


9.7 kb 


+++ 


EcoRV 


(-912) to 2309 


3.2 kb 


na 


Hindlll 


665 to 5438 


4.7 kb 


na 


Kpnl 


1441 to 8137 


6.9 kb 


na 


Sacl/Xbal 


2677 to 9712 


7.0 kb 


na 



na = not active 

+ = significant growth inhibition 

++ = >40% mortality, but less than 100% 

+++ = 1 00% mortality 

Example 7: Characterization of pCIB9359-7 Insect Control Activity By Titration 

Dilutions of a culture of E.coli strain 9359-7 containing pCIB9359-7 are tested for 
insecticidal activity in insect bioassays. Dilutions are prepared in a culture of E.co// XL-1 in a 
total volume of 100 |il and are transferred to diet cups with 5 insects per cup. The results 
show the percentage (%) of insect mortality. 
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111 r^ulture 


Px 


Hv 


Hz 


Tn 

m mm 


100 


100 


72 


48 


100 


50 


100 


84 


68 


92 


25 


100 


52 


32 


100 


12.5 


96 


52 


36 


68 


6.25 


88 


20 


4 


32 


0 


36 


20 


24 


0 



Px= P, xylostella, Hv= H. virescens, Hz= H. zea, Tn = T. ni 

Cultures of £. co// 9359-7 still show substantial insecticidal activity after dilution. 

Example 8: Stability of pCIB9359-7 Activity 

The stability of the toxins is tested after storage for 2 weeks at different 
temperatures and conditions. 300 ml of Luria broth containing 100 (p^g/ml ampicillin is 
inoculated with E. coli strain 9359-7 and grown overnight at 37^C. Samples are placed in 
sterile 15 ml screw cap tubes and stored at 22°C and 4*'C. Another sample is centrifuged; 
the supernatant is removed, freeze dried and stored at 22''C. The samples are stored under 
these conditions for 2 weeks and then a bioassay is conducted against P. xylostella. The 
freeze dried material is resuspended in the same volume as before. All samples are 



resuspended by vortexing. 


Conditions 


Results 


22°C (2 weeks) 


+++ 


4°C (2 weeks) 


+++ 


Freeze Dried (2 weeks) 


+++ 



na = not active; + = significant growth inhibition; ++ = >40% mortality, but less than 100%; 
+++ = 1 00% mortality 
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This demonstrates that the toxins retain their activity for at least two weeks at 22*^0, 
4°C, and freeze-dried, and are therefore very stable. 

Example 9: Size Fraction of pCIB9359-7 Activity 

The approximate sizes of the insecticidal toxins are determined. P. luminescens 
cosmid clones pCIB9359-7 and pUC19 in E. colt host DH5a are grown in media consisting 
of 50% Terrific broth and 50% Luria broth, supplemented with 50 |xg/ml ampicillin. Cultures 
(three tubes of each strain) are inoculated into 3 ml of the above media in culture tubes and 
incubated on a roller wheel overnight at 37*^C, Cultures of each strain are combined and 
sonicated using a Branson Model 450 Sonicator, micro tip, for approximately six 10 second 
cycles with cooling on ice between cycles. The sonicates are centrifuged in a Sorvall SS34 
rotor at 6000 RPM for 10 minutes. The resultant supernatants are filtered through a 0.2 p 
filter. The 3 ml fractions of the filtrates are applied to Bio-Rad Econo-Pac 10DG columns 
that have been previously equilibrated with 10 ml of 50mM NaCI, 25 mM Tris base, pH 7.0. 
The flow through collected during sample loading is discarded. The samples are 
fractionated with two subsequent additions of 4 ml each of the NaCI - Tris equilibration 
buffer. The two four ml fractions are saved for testing. The first fraction contains all 
material above about 6,000 mol. wt; the second fraction contains material smaller than 
6,000 mol. wt. A sample of the whole culture broth, the sonicate, and the filtered 
supernatant on the sonicate are tested along with the three fractions from the 10DG column 
for activity on P. xylostella neonates in bioassays. 

The culture, the sonicate, and the filtered supernatant of the sonicate, and the first 
column fraction from the 9359-7 sample are highly active on P. xylostella. The second 
column fraction from 9359-7 is slightly active (some stunting only). No activity is found in 
the third fraction from 9359-7, The sample from DH5-pUC19 does not have any activity. 
This indicates that the molecular weights of the toxins are above 6,000. 

Example 10: Heat Inactivitation of pCIB9359-7 Activity 

The heat stability of the toxins is determined. Overnight cultures of the E. co// strain 
pCIB9359-7 are grown in a 50:50 mixture of Luria broth and Terrific broth. Cultures are 
grown at 37°C in culture tubes on a tube roller. A one ml sample of the culture is placed in 



5DOCID: <WO 9942589A2_L> 



wo 99/42589 



PCT/EP99/010I5 



-36- 

a 1.5 ml eppendorf tube and placed in a boiling water bath. The sample is removed after 
five minutes and allowed to cool to room temperature. This sample along with an untreated 
portion of the culture is assayed on P. xylostella, SOpI of sample of sample is spread on 
diet, allowed to dry and neonate larvae P. xylostella applied to the surface. The assay is 
incubated for 5 days at room temperature. 

The untreated sample causes 1 00% mortality. The heat treated sample and a diet 
alone control do not cause any observable mortality, showing the toxins are heat sensitive. 

Example 11: Leaf Dip Bioassay of pCIB9359-7 

Insecticidal activity of the toxins is tested in a leaf dip bioassay. Six leaves 
approximately 2cm in diameter each are cut from seedlings of tumip and placed in a 1oz. 
plastic cup (Jet Plastica) with 4ml-5ml of the resuspended toxin, covered tightly, and shaken 
until thoroughly wetted. The treated leaves are placed in 50mm petri dishes (Gelman 
Sciences) on absorbent pads moistened with 300\x\ of water. The dish covers are left open 
until the leaf surface appears dry and then placed on tightly so that the leaves do not dry 
out. 

Ten neonate P. xylostella larvae are placed in each petri dish arena. Also, a 
treatment of 0.1% Bond spreader/sticker with no toxin is set up as a control. The arenas 
are monitored daily for signs of drying leaves, and water is added or leaves replaced if 
necessary. After 3 days the leaves and arenas are examined under a dissecting 
microscope, and the number of live larvae in each arena is recorded. 

100% mortality is found for 9359-7 and none in the no-toxin control, showing that the 
toxins are also insecticidal in a leaf dip assay. 

B. Isolation Of Nucleic Acid Sequences Whose Expression Results In Toxins Active 
Against Lepidooteran and Coleopteran Insects 

Example 12: Total DNA Isolation from Photorhabdus luminescens 

Photorhabdus luminescens strain ATCC 29999 is grown 14-18 hours in L broth. 
Total DNA is isolated from 1.5 mis of culture resuspended in 0.5% SDS, lOO^g/ml 
proteinase K. TE to a final volume of 600 nl. After a 1 hour incubation at 37^C, 100^1 5M 
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NaCI and 80^.1 CTAB/NaCI are added and the culture is incubated at 65°C for 10 minutes. 
An equal volume of chloroform is added; the culture is mixed gently and spun. The 
aqueous phase is extracted once with phenol and once with chloroform. The nucleic acids 
are treated with 10 |xg RNase A for 30 minutes at room temperature. The aqueous phase is 
mixed with 0.6 volumes isopropanol and the sample is centrifuged. The pellet is washed 
once with 70% ethanol and the nucleic acids are gently resuspended in 100-200ul TE. 

Example 13: PCR Amplification of Probes 

Two probes are PCR amplified from Photorhabdus luminescens strain ATCC 29999 
genomic DNA using oligos 5'-ACACAGCAGGTTCGTCAG-3' (SEQ ID NO:7) and 5'- 
GGCAGAAGCACTCAACTC-3' (SEQ ID NO:8) to amplify probe #1 and oligos 5'- 
ATTGATAGCACGCGGCGACC-3' (SEQ ID NO:9) and 5'- 

TTGTAACGTGGAGCCGAACTGG-3' (SEQ ID NO:10) to amplify probe #2. The oligos are 
ordered from Genosys Biotechnologies, Inc. (Texas). Approximately 10-50 ng of genomic 
DNA is used as the template. 0.8|liM of oligos, 200^M of dNTPs, IX Taq DNA Polymerase 
buffer and 2.5 units of Taq DNA Polymerase are included in the reaction. The reaction 
conditions are as follows: 
94''C - 1 minute 

94°C - 30 seconds / 60°C - 30 seconds / 72°C - 30 seconds (25 cycles) 
72''C - 5 minutes 
A^'C ' indefinite soak 

The reactions are preferably carried out in a PCR System 9600 (Perkin Elmer) 
thermocycler. 

Example 14: Probing a Photorhabdus luminescens Library 

600 clones from the P. luminescens cosmid library described in Example 1 are 
patched to L-amp plates in duplicate. The colonies are grown overnight then moved to 4°C. 
The colonies are lifted onto Colony/Plaque Screen Hybridization Transfer Membranes 
(Biotechnology Systems NEN Research Products). The membranes are incubated 2-3 
minutes in 0.75ml 0.5N NaOH twice. The membranes are then incubated 2-3 minutes in 
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0.75ml 1.0M Tris-HCI, pH 7.5 twice. The membranes are allowed to dry at room 
temperature. 

Probe #1 and probe #2 described in Example 13 are labeled using the DECAprime II 
Kit as described by the manufacturer (Ambion cat# 1455). Unincorporated nucleotides are 
removed from the labeled probes using Quick Spin Columns as described by the 
manufacturer (Boehringer Mannheim cat #1273973). The labeled probes are measured for 
incorporated radioactivity and the specific activity is 10,000,000 cpm. Membranes are 
prewetted with 2X SSC and hybridized with the probes for 12-16 hours at 65°C. One set of 
colony lifts is hybridized with probe #1 and the other set is hybridized with probe #2. The 
membranes are washed with wash CHURCH solutions 1 and 2 (Church and Gilbert, Proc. 
Natl. Acad. ScL USA 81:1991-1995 (1984)) and exposed to Kodak film. 

Twenty one clones are identified that hybridize to probe #1 and seven clones are 
identified that hybridize to probe #2. The gene in the clones isolated with probe #1 is 
named hphi and the gene in the clones isolated with probe #2 is named hph2. 

Example 15: Insect Bioassays 

The clones identified in Example 14 are tested for insecticidal activity against the 
following insects in insect bioassays: Diabrotica virgifera virgifera (Western Corn Rootworm 
(WCR)). Diabrotica undecimpunctata howardi (Southern Corn Rootworm (SCR)), Ostrinia 
nubilalis (European Corn Borer (ECB)). and Plutella xylostella (Diamondback Moth (DBM)). 

Diabrotica virgifera virgifera (Western Corn Rootworm) and Diabrotica 
undecimpunctata howardi {Southern Corn Rootworm) assays are performed using a diet 
incorporation method. 500|xl of an overnight culture of the cosmid library in XL-1 Blue MR 
cells (Stratagene) is sonicated and then mixed with 500|il of diet. Once the diet solidifies, it 
is dispensed in a petri dish and 20 lan/ae are introduced over the diet. Trays of dishes are 
placed in an incubator for 3-5 days, and percent mortality is recorded at the end of the 
assay period. 

Ostrinia nubilalis (European Corn Borer) and Plutella xylostella (Diamondback Moth) 
assays are performed by a surface treatment method. The diet is poured in the petri dish 
and allowed it to solidify. The E. co// culture of 200 -300|xl volume is dispensed over the diet 
surface and entire diet surface is covered to spread the culture with the help of bacterial 
loop. Once the surface is dry. 10 larvae are introduced over the diet surface. Trays of 
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dishes are placed in an incubator for 3-5 days. The assay with European Corn Borer is 
incubated at 30°C in complete darkness; the assay with Diamondback Moth is incubated at 
72°F with a 14:10 (hours) lightrdark cycle. Percent mortality is recorded at the end ot the 
assay period. 

Cosmids containing hph2 are identified with a range of activities, including: WCR 
only; SCR only; WCR and SCR; SCR and ECB; WCR. SCR, and ECB; or WCR. SCR. ECB. 
and DBM activity. 

In addition to probing the P. luminescens cosmid library with DNA probes. 600 
clones are screened by Western Corn Rootworm bioassay. A clone is identified with activity 
against Western Com Rootworm. This clone hybridizes with probe #2. 

From these bioassays, cosmid 514, having activity against WCR, SCR. ECB, and 
DBM, is selected for sequencing. 

Example 16: Sequencing of Cosmid 514 

Cosmid 514 is sequenced using dye terminator chemistry on an ABI 377 instrument. 
The nucleotide sequence of cosmid 514 is set forth as SEQ ID NO:11. Cosmid 514 is 
designated pNOV2400 and deposited with the NRRL in E. coli DH5a and assigned 
accession no. B-30077. 

Example 17: Subcloning Insecticidal Regions of Cosmid 514 
514a 

An 9011 base pair fragment within cosmid 514 (SEQ ID NO:11) is removed by 
digesting the cosmid with the restriction endonuclease Spel (New England Biolabs 
(Massachusetts), and ligating (T4 DNA Ligase. NEB) the remainder of 514. Subclone 514a 
consists of cosmid 514 DNA from base pairs 1-2157 ligated to base pairs 1 1 ,169-37,948. 

H202/DET34 

hph2 and orfZ (SEQ ID NO:11, base pairs 23,768-35,838) are cloned into pET34b 
(Novagen, Wisconsin). Restriction sites are engineered on both ends of each gene to 
facilitate cloning. PGR is used to add the restriction sites to the genes. A BamH\ site is on 
the 5' end of hph2 immediately upstream of the ATG of hph2, and a Sad site is added to 
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the 3* end of hph2 immediately following the DNA triplet encoding the stop codon. A 
guanidine is added between the BamHI site and the start codon of hph2 to put the hph2 
gene in frame with the Cellulose Binding Domain tag in pET34b. Orf2 has a Sad site 
upstream of the 56 base pairs between the stop codon of hph2 and the start codon of orf2. 
The 56 base pairs are included in the hph2'Orf2 construct to mimic their setup in the 514 
cosmid. Orf2 has an Xho\ site on the 3' end immediately following the stop codon. The 
oligos used to add the restriction sites to hph2 and orf2 are as follows: 

hph2-A 5'-CGGGATCCGATQATTTTAAAAGG-3' (SEQ ID NO: 15) 
hph2-B 5'-GCGCCATTGATTTGAG-3' (SEQ ID NO:1 6) 
hph2-C 5'-CATTAGAGGTCGAACGTAC-3* (SEQ ID NO:17) 
hph2-D 5'-GAGCGAGCTCTTACTTAATGGTGTAG-3* (SEQ ID NO: 18) 
orf2-A3 5'-CAGCGAGCTCCATGCAGAATTCAGAGAC-3' (SEQ ID NO:19) 
orf2-B 5'-GGCAATGGCAGCGATAAG-3' (SEQ ID NO:20) 
orf2-C 5'-CATTAACGCAGGAAGAGC-3' (SEQ ID NO:21 ) 
orf2-D 5'-GACCTCGAGTTACACGAGCGCGTCAG-3' (SEQ ID NO:22) 
The BamH\-SacA 7583 base pair fragment, corresponding to the hph2 gene, and the 
Sad'Xhol 4502 base pair orf2 (including the 56 base pairs between hph2 and orf2 open 
reading frames), corresponding to orf2, are ligated with SamHI-X/iol-digested vector DNA 
pET34b. 

Orf5/DBS (Non-Bamm 
The 5325 base pair NoA-BarriHl fragment of cosmid 514 is cloned into pBS-SK using 
Ami'Noti (415 bp) and ean7HI-/4f/III (2530 bp) fragments of pBS-SK. 

05-H2-02 

The 12,031 base pair BamH\'Xho\ fragment of H202/pET34 is cloned into the 8220 
base pair Xho\'BamH\ fragment of Orf5/pBS. 

O51011H2O2 

A 7298 base pair BamH\-Mlul fragment from subclone 514a is ligated (T4 DNA 
Ligase, NEB) with 9588 bp Mlu\-Xhol and 8220 bp Xhol-Bamhl fragments of subclone 05- 
H2-02. The resulting - 22 kb subclone O51011H2O2, which has activity against WCR and 
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ECB, is designated pNOVIOOl and deposited with the NRRL in E. coli DH5a and assigned 
accession no. B-30078. 

AKH202 

A 12.074 base pair BamH\-Avr\\ fragment of H202ypET34 is ligated (T4 DNA Ugase, 
NEB) into pK184 Nhel-BamHl fragment (2228 bp), generating a clone containing hph2 and 
orf2 in a pi 5a origin of replication, kanamycin-resistant vector. 

Example 18: Insecticidal Activity of Subclones 

Bioassays as described above are performed with E. coli cultures that express the 
above subclones, both singly and in combination. Coexpressing AKH202 and Orf5/pBS in 
E. coli, for example in DH5a or HB101, is found to give insecticidal activity against the 
Lepidopterans Plutella xylostella (DIamondback Moth), Ostrinia nubilalis (European Corn 
Borer), and Manduca sexta (Tobacco Hornworm), as well as against the Coleopterans 
Diabrotica virgifera virglfera (Westem Corn Rootworm), Diabrotica undecimpunctata 
howardi (Southern Corn Rootworm), and Leptinotarsa decimlineata (Colorado Potato 
Beetle), Thus, coexpression of hph2 (SEQ ID NO:11. base pairs 23.768-31,336). orf2 
(SEQ ID NO:11, base pairs 31,393-35,838), and orfS (SEQ ID NO:11. base pairs 15,171- 
1 8,035) is sufficient to control these insects. In addition, expression of each of these three 
ORFs on separate plasmids gives insect control activity, demonstrating that they do not 
have to be genetically linked to be active, so long as all three gene products are present. 

C, Expression of the Nucleic Acid Sequences of the Invention in Heterologous 

Microbial Hosts 

Microorganisms which are suitable for the heterologous expression of the nucleotide 
sequences of the invention are all microorganisms which are capable of colonizing plants or 
the rhizosphere. As such they will be brought into contact with insect pests. These include 
gram-negative microorganisms such as Pseudomonas, Enterobacter and Serratia, the 
gram-positive microorganism Bacillus and the fungi Trichoderma, Gliocladium, and 
Saccharomyces cerevisiae. Particularly preferred heterologous hosts are Pseudomonas 
fluorescens, Pseudomonas putida, Pseudomonas cepacia, Pseudomonas aureofaciens, 
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Pseudomonas aurantiaca, Enterobacter cloacae, Serratia marscesens, Bacillus subtilis, 
Bacillus cereus, Trichoderma viride, Trichoderma harzianum, Gliocladium virens, and 
Saccharomyces cerevisiae. 

Example 19: Expression of the Nucleotide Sequences in E. coli and Other Gram- 
Negative Bacteria 

Many genes have been expressed in gram-negative bacteria in a heterologous 
manner. Expression vector pKK223-3 (Pharmacia catalogue # 27-4935-01) allows 
expression in E. coll. This vector has a strong tac promoter (Brosius, J. et aL, Proc. NatL 
Acad. ScL USA 81) regulated by the lac repressor and induced by IPTG. A number of other 
expression systems have been developed for use in E. coli. The thermoinducible 
expression vector pPL (Phamiacia #27-4946-01) uses a tightly regulated bacteriophage X 
promoter which allows for high level expression of proteins. The lac promoter provides 
another means of expression but the promoter is not expressed at such high levels as the 
tac promoter. With the addition of broad host range replicons to some of these expression 
system vectors, expression of the nucleotide sequence in closely related gram negative- 
bacteria such as Pseudomonas, Enterobacter, Serratia and Erwinia is possible. For 
example. pLRKD211 (Kaiser & Kroos, Proc. Natl. Acad. Sci. USA 81: 5816-5820 (1984)) 
contains the broad host range replicon oh 7 which allows replication in many gram-negative 
bacteria. 

In E. coli, induction by IPTG is required for expression of the tac {i.e. trp-lac) 
promoter. When this same promoter (e.gf. on wide-host range plasmid pLRKD211) is 
introduced into Pseudomonas it is constitutively active without induction by IPTG. This trp- 
lac promoter can be placed in front of any gene or operon of interest for expression in 
Pseudomonas or any other closely related bacterium for the purposes of the constitutive 
expression of such a gene. Thus, a nucleotide sequence whose expression results in an 
insecticidal toxin can therefore be placed behind a strong constitutive promoter, transferred 
to a bacterium which has plant or rhizosphere colonizing properties turning this organism to 
an insecticidal agent. Other possible promoters can be used for the constitutive expression 
of the nucleotide sequence in gram-negative bacteria. These include, for example, the 
promoter from the Pseudomonas regulatory genes gafA and lemA (WO 94/01561) and the 
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Pseudomonas savastanoi \AA operon promoter (Gaffney et aL, J. Bactehol. 772; 5593-5601 
(1990). 

Example 20: Expression of the Nucleotide Sequences in Gram-Positive Bacteria 

Heterologous expression of the nucleotides sequence in gram-positive bacteria is 
another means of producing the insecticidal toxins. Expression systems for Bacillus and 
Streptomyces are the best characterized. The promoter for the erythromycin resistance 
gene (ermR) from Streptococcus pneumoniae has been shown to be active in gram-positive 
aerobes and anaerobes and also in E.coli (Trieu-Cuot et al., NucI Acids Res 18: 3660 
(1990)). A further antibiotic resistance promoter from the thiostreptone gene has been used 
in Streptomyces cloning vectors (Bibb, Mo! Gen Genet 199 : 26-36 (1985)). The shuttle 
vector pHT3101 is also appropriate for expression in Bacillus (Lereclus, FEMS Microbiol 
Lett 60: 211-218 (1989)). A significant advantage of this approach is that many gram- 
positive bacteria produce spores which can be used in formulations that produce 
insecticidal agents with a longer shelf life. Bacillus and Streptomyces species are 
aggressive colonizers of soils 

Example 21 : Expression of the Nucleotide Sequences in Fungi 

Trichoderma harzianum and Gliocladium virens have been shown to provide varying 
levels of biocontrol in the field (US 5,165,928 and US 4,996.157. both to Cornell Research 
Foundation). A nucleotide sequence whose expression results in an insecticidal toxin could 
be expressed in such a fungus. This could be accomplished by a number of ways which are 
well known in the art. One is protoplast-mediated transformation of the fungus by PEG or 
electroporation-mediated techniques. Alternatively, particle bombardment can be used to 
transform protoplasts or other fungal cells with the ability to develop into regenerated 
mature structures. The vector pAN7-1. originally developed for Aspergillus transformation 
and now used widely for fungal transformation (Curragh et ai, Mycol. Res. 97(3): 313-317 
(1992;; Tooley e/a/., Curr Genet, 21: 55-60 (1992); Punt ef a/.. Gene 56: 117-124 (1987)) 
is engineered to contain the nucleotide sequence. This plasmid contains the E. coil the 
hygromycin B resistance gene flanked by the Aspergillus nidulans gpd promoter and the 
frpC terminator (Punt e/a/., Gene 56: 117-124 (1987)). 
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In a preferred embodiment, the nucleic acid sequences of the invention are 
expressed in the yeast Saccharomyces cerevisiae. Each of the three ORF's of SEQ ID 
NO:11 (hph2, orf2 and orf5), which together confer insecticidal activity, are cloned into 
individual vectors with the GAL1 inducible promoter and the CYC1 terminator. Each vector 
has ampicillin resistance and the 2 micron replicon. The vectors differ in their yeast growth 
markers. hph2 is cloned into p424 (TRP1. ATCC 87329), orf2 into p423 (HISS, ATCC 
87327), and orfS into p425 (LEU2, ATCC 87331). The three constructs are transformed 
into S. cerevisiae independently and together. The three ORFs are expressed together and 
tested for protein expression and insecticidal activity. 

D. Expression of the Nucleotide Sequences in Transgenic Plants 

The nucleic acid sequences described in this application can be incorporated into 
plant cells using conventional recombinant DNA technology. Generally, this involves 
inserting a coding sequence of the invention into an expression system to which the coding 
sequence is heterologous (i.e., not normally present) using standard cloning procedures 
known in the art. The vector contains the necessary elements for the transcription and 
translation of the inserted protein-coding sequences. A large number of vector systems 
known in the art can be used, such as plasmids, bacteriophage viruses and other modified 
viruses. Suitable vectors include, but are not limited to, viral vectors such as lambda vector 
systems Xgtll . XgtlO and Charon 4; plasmid vectors such as pBI121. pBR322. pACYC177. 
pACYC184, pAR series, pKK223-3, pUC8, pUC9, pUC18. pUCIQ, pLG339, pRK290, 
pKC37, pKC101. pCDNAII; and other similar systems. The components of the expression 
system may also be modified to increase expression. For example, truncated sequences, 
nucleotide substitutions or other modifications may be employed. The expression systems 
described herein can be used to transform virtually any crop plant cell under suitable 
conditions. Transformed cells can be regenerated into whole plants such that the 
nucleotide sequence of the invention confer insect resistance to the transgenic plants. 

Example 22: Modification of Coding Sequences and Adjacent Sequences 

The nucleotide sequences described in this application can be modified for 
expression in transgenic plant hosts. A host plant expressing the nucleotide sequences and 
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which produces the insecticidal toxins in its cells has enhanced resistance to insect attack 
and is thus better equipped to withstand crop losses associated with such attack. 

The transgenic expression in plants of genes derived from microbial sources may 
require the modification of those genes to achieve and optimize their expression in plants. 
In particular, bacterial ORFs which encode separate enzymes but which are encoded by the 
same transcript in the native microbe are best expressed in plants on separate transcripts. 
To achieve this, each microbial ORF is isolated individually and cloned within a cassette 
which provides a plant promoter sequence at the 5' end of the ORF and a plant 
transcriptional terminator at the 3' end of the ORF. The isolated ORF sequence preferably 
includes the initiating ATG codon and the terminating STOP codon but may include 
additional sequence beyond the initiating ATG and the STOP codon. In addition, the ORF 
may be truncated, but still retain the required activity; for particularly long ORFs, truncated 
versions which retain activity may be preferable for expression in transgenic organisms. By 
"plant promoter"' and "plant transcriptional terminator" it is intended to mean promoters and 
transcriptional terminators which operate within plant cells. This includes promoters and 
transcription terminators which may be derived from non-plant sources such as viruses (an 
example is the Cauliflower Mosaic Virus). 

In some cases, modification to the ORF coding sequences and adjacent sequence 
is not required. It is sufficient to isolate a fragment containing the ORF of interest and to 
insert it downstream of a plant promoter. For example, Gaffney et aL (Science 261 : 754- 
756 (1993)) have expressed the Pseudomonas nahG gene in transgenic plants under the 
control of the CaMV 358 promoter and the CaMV tml terminator successfully without 
modification of the coding sequence and with x bp of the Pseudomonas gene upstream of 
the ATG still attached, and y bp downstream of the STOP codon still attached to the nahG 
ORF. Preferably as little adjacent microbial sequence should be left attached upstream of 
the ATG and downstream of the STOP codon. In practice, such construction may depend 
on the availability of restriction sites. 

In other cases, the expression of genes derived from microbial sources may provide 
problems in expression. These problems have been well characterized in the art and are 
particularly common with genes derived from certain sources such as Bacillus. These 
problems may apply to the nucleotide sequence of this invention and the modification of 
these genes can be undertaken using techniques now well known in the art. The following 
problems may be encountered: 
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1 . Codon Usage. 

The preferred codon usage in plants differs from the preferred codon usage in 
certain microorganisms. Comparison of the usage of codons within a cloned microbial ORF 
to usage in plant genes (and in particular genes from the target plant) will enable an 
identification of the codons within the ORF which should preferably be changed. Typically 
plant evolution has tended towards a strong preference of the nucleotides C and G in the 
third base position of monocotyledons, whereas dicotyledons often use the nucleotides A or 
T at this position. By modifying a gene to incorporate preferred codon usage for a particular 
target transgenic species, many of the problems described below for GC/AT content and 
illegitimate splicing will be overcome. 

2. GC/AT Content. 

Plant genes typically have a GC content of more than 35%. ORF sequences which 
are rich in A and T nucleotides can cause several problems in plants. Firstly, motifs of 
ATTTA are believed to cause destabilization of messages and are found at the 3' end of 
many short-lived mRNAs. Secondly, the occurrence of polyadenylatton signals such as 
AATAAA at inappropriate positions within the message is believed to cause premature 
truncation of transcription. In addition, monocotyledons may recognize AT-rich sequences 
as splice sites (see below). 

3. Sequences Adjacent to the Initiating Methionine. 

Plants differ from microorganisms in that their messages do not possess a defined 
ribosome binding site. Rather, it is believed that ribosomes attach to the 5' end of the 
message and scan for the first available ATG at which to start translation. Nevertheless, it 
is believed that there is a preference for certain nucleotides adjacent to the ATG and that 
expression of microbial genes can be enhanced by the inclusion of a eukaryotic consensus 
translation initiator at the ATG. Clontech (1993/1994 catalog, page 210, incorporated 
herein by reference) have suggested one sequence as a consensus translation initiator for 
the expression of the E. coli uidA gene in plants. Further, Joshi (NAR 15: 6643-6653 
(1 987), incorporated herein by reference) has compared many plant sequences adjacent to 
the ATG and suggests another consensus sequence. In situations where difficulties are 
encountered in the expression of microbial ORFs in plants, inclusion of one of these 
sequences at the initiating ATG may improve translation. In such cases the last three 
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nucleotides of the consensus may not be appropriate for inclusion in the modified sequence 
due to their modification of the second AA residue. Preferred sequences adjacent to the 
initiating methionine may differ between different plant species. A survey of 14 maize 
genes located in the GenBank database provided the following results: 



Position Before the Initiating ATG in 14 Maize Genes : 
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This analysis can be done for the desired plant species into which the nucleotide sequence 
is being incorporated, and the sequence adjacent to the ATG modified to incorporate the 
preferred nucleotides. 

4. Removal of Illegitimate Splice Sites. 

Genes cloned from non-plant sources and not optimized for expression in plants 
may also contain motifs which may be recognized in plants as 5' or 3' splice sites, and be 
cleaved, thus generating truncated or deleted messages. These sites can be removed 
using the techniques well known in the art. 

Techniques for the modification of coding sequences and adjacent sequences are 
well known in the art. In cases where the initial expression of a microbial ORF is low and it 
is deemed appropriate to make alterations to the sequence as described above, then the 
construction of synthetic genes can be accomplished according to methods well known in 
the art. These are, for example, described in the published patent disclosures EP 0 385 
962 (to Monsanto), EP 0 359 472 (to Lubrizol) and WO 93/07278 (to Ciba-Geigy), all of 
which are incorporated herein by reference. In most cases it is preferable to assay the 
expression of gene constructions using transient assay protocols (which are well known in 
the art) prior to their transfer to transgenic plants. 
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Example 23: Construction of Plant Expr ssion Cass tt s 

Coding sequences intended for expression in transgenic plants are first assembled in 
expression cassettes behind a suitable promoter expressible in plants. The expression 
cassettes may also comprise any further sequences required or selected for the expression 
of the transgene. Such sequences include, but are not restricted to, transcription 
terminators, extraneous sequences to enhance expression such as introns, vital sequences, 
and sequences intended for the targeting of the gene product to specific organelles and cell 
compartments. These expression cassettes can then be easily transferred to the plant 
transformation vectors described below. The following is a description of various 
components of typical expression cassettes. 

1. Promoters 

The selection of the promoter used in expression cassettes will determine the spatial 
and temporal expression pattern of the transgene in the transgenic plant. Selected 
promoters will express transgenes in specific cell types (such as leaf epidermal cells, 
mesophyll cells, root cortex cells) or in specific tissues or organs (roots, leaves or flowers, 
for example) and the selection will reflect the desired location of accumulation of the gene 
product. Alternatively, the selected promoter may drive expression of the gene under 
various inducing conditions. Promoters vary in their strength, i.e., ability to promote 
transcription. Depending upon the host cell system utilized, any one of a number of suitable 
promoters can be used, including the gene's native promoter. The following are non- 
limiting examples of promoters that may be used in expression cassettes. 

a. Constitutive Expression, the Ubiquitin Promoter: 

Ubiquitin is a gene product known to accumulate in many cell types and its promoter 
has been cloned from several species for use in transgenic plants {e.g. sunflower - Binet et 
ai Plant Science 79: 87-94 (1991); maize - Christensen et aL Plant Molec. Biol. 12: 619- 
632 (1989); and Arabidopsis - Norris et aL, Plant Moi Biol. 21 :895-906 (1993)). The maize 
ubiquitin promoter has been developed in transgenic monocot systems and its sequence 
and vectors constructed for monocot transformation are disclosed in the patent publication 
EP 0 342 926 (to Lubrizol) which is herein incorporated by reference. Taylor et al. (Plant 
Cell Rep. 12: 491-495 (1993)) describe a vector (pAHC25) that comprises the maize 
ubiquitin promoter and first intron and its high activity in cell suspensions of numerous 
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monocotyledons when introduced via microprojectile bombardment. The Arabidopsis 
ubiquitin promoter is ideal for use with the nucleotide sequences of the present invention. 
The ubiquitin promoter is suitable for gene expression in transgenic plants, both 
monocotyledons and dicotyledons. Suitable vectors are derivatives of pAHC25 or any of 
the transformation vectors described in this application, modified by the introduction of the 
appropriate ubiquitin promoter and/or intron sequences. 

b. Constitutive Expression, the CaMV 35S Promoter: 

Construction of the plasmid pCGN1761 is described in the published patent 
application EP 0 392 225 (Example 23). which is hereby incorporated by reference. 
pCGN1761 contains the "double" CaMV 35S promoter and the fm/ transcriptional terminator 
with a unique EcoRI site between the promoter and the terminator and has a pUC-type 
backbone. A derivative of pCGN1761 is constructed which has a modified polylinker which 
includes NotI and Xhol sites in addition to the existing EcoRI site. This derivative is 
designated pCGN1761ENX. pCGN1761ENX is useful for the cloning of cDNA sequences 
or coding sequences (including microbial ORF sequences) within its polylinker for the 
purpose of their expression under the control of the 35S promoter in transgenic plants. The 
entire 35S promoter-coding sequence-fm/ terminator cassette of such a construction can be 
excised by Hindlll, SphI, Sail, and Xbal sites 5' to the promoter and Xbal, BamHI and Bgll 
sites 3' to the terminator for transfer to transformation vectors such as those described 
below. Furthermore, the double 35S promoter fragment can be removed by 5' excision with 
Hindlll, SphI, Sail, Xbal, or PstI, and 3' excision with any of the polylinker restriction sites 
{EcoRI, NotI or Xhol) for replacement with another promoter. If desired, modifications 
around the cloning sites can be made by the introduction of sequences that may enhance 
translation. This is particularly useful when overexpression is desired. For example, 
pCGN1761ENX may be modified by optimization of the translational initiation site as 
described in Example 37 of U.S. Patent No. 5,639,949, incorporated herein by reference. 

c. Constitutive Expression, the Actin Promoter: 

Several isoforms of actin are known to be expressed in most cell types and 
consequently the actin promoter is a good choice for a constitutive promoter. In particular, 
the promoter from the rice Act! gene has been cloned and characterized (McElroy et al. 
Plant Cell 2: 163-171 (1990)). A 1.3kb fragment of the promoter was found to contain all 
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the regulatory elements required for expression in rice protoplasts. Furthermore, numerous 
expression vectors based on the ActI promoter have been constructed specifically for use in 
monocotyledons (McElroy et al. Mol Gen. Genet. 231: 150-160 (1991)). These incorporate 
the /\c//-intron 1 , Adhl 5' flanking sequence and Adhl-intron 1 (from the maize alcohol 
dehydrogenase gene) and sequence from the CaMV 35S promoter. Vectors showing 
highest expression were fusions of 35S and ActI intron or the ActI 5' flanking sequence and 
the ActI intron. Optimization of sequences around the initiating ATG (of the GUS reporter 
gene) also enhanced expression. The promoter expression cassettes described by McElroy 
etaL (Mol. Gen. Genet. 231: 150-160 (1991)) can be easily modified for gene expression 
and are particularly suitable for use in monocotyledonous hosts. For example, promoter- 
containing fragments is removed from the McElroy constructions and used to replace the 
double 35S promoter in pCGN1761ENX, which is then available for the insertion of specific 
gene sequences. The fusion genes thus constructed can then be transferred to appropriate 
transformation vectors. In a separate report, the rice ActI promoter with its first intron has 
also been found to direct high expression in cultured barley cells (Chibbar et al. Plant Cell 
Rep. 12: 506-509 (1993)). 

d. Inducible Expression, the PR-1 Promoter: 

The double 35S promoter in pGGN1761ENX may be replaced with any other promoter 
of choice that will result in suitably high expression levels. By way of example, one of the 
chemically regulatable promoters described in U.S. Patent No. 5,614,395 may replace the 
double 35S promoter. The promoter of choice is preferably excised from its source by 
restriction enzymes, but can altematively be PCR-amplified using primers that carry 
appropriate terminal restriction sites. Should PCR-amplification be undertaken, then the 
promoter should be re-sequenced to check for amplification errors after the cloning of the 
amplified promoter in the target vector. The chemically/pathogen regulatable tobacco PR- 
1a promoter is cleaved from plasmid pCIB1004 (for construction, see example 21 of 
EP 0 332 104, which is hereby incorporated by reference) and transferred to plasmid 
PCGN1761ENX (Uknes et al.. 1992). pCIB1004 is cleaved with Ncol and the resultant 3* 
overhang of the linearized fragment is rendered blunt by treatment with T4 DNA 
polymerase. The fragment is then cleaved with Hindlll and the resultant PR-la promoter- 
containing fragment is gel purified and cloned into pCGN1761ENX from which the double 
35S promoter has been removed. This is done by cleavage with Xhol and blunting with T4 
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polymerase, followed by cleavage with Hindlll and isolation of the larger vector-terminator 
containing fragment into which the pCIB1004 promoter fragment is cloned. This generates 
a pCGN1761ENX derivative with the PR-1a promoter and the tml terminator and an 
intervening polylinker with unique EcoRI and NotI sites. The selected coding sequence can 
be inserted into this vector, and the fusion products {i.e, promoter-gene-terminator) can 
subsequently be transferred to any selected transformation vector, including those 
described infra. Various chemical regulators may be employed to induce expression of the 
selected coding sequence in the plants transformed according to the present invention, 
including the benzothiadiazole, isonicotinic acid, and salicylic acid compounds disclosed in 
U.S. Patent Nos. 5,523.31 1 and 5,614.395. 

e. Inducible Expression, an Ethanol-lnducible Promoter: 

A promoter inducible by certain alcohols or ketones, such as ethanol, may also be 
used to confer inducible expression of a coding sequence of the present invention. Such a 
promoter is for example the aIcA gene promoter from Aspergillus nidulans (Caddick et al. 
(1998) Nat Biotechnol 16:177-180). In A. nidulans, the aIcA gene encodes alcohol 
dehydrogenase I, the expression of which is regulated by the AlcR transcription factors in 
presence of the chemical inducer. For the purposes of the present invention, the CAT 
coding sequences in plasmid palcA:CAT comprising a aIcA gene promoter sequence fused 
to a minimal 35S promoter (Caddick et al. (1998) Nat. Biotechnol 16:177-180) are replaced 
by a coding sequence of the present invention to form an expression cassette having the 
coding sequence under the control of the aIcA gene promoter. This is carried out using 
methods well known in the art. 

f. Inducible Expression, a Glucocorticoid-Inducible Promoter: 

Induction of expression of a nucleic acid sequence of the present invention using 
systems based on steroid hormones is also contemplated. For example, a glucocorticoid- 
mediated induction system is used (Aoyama and Chua (1997) The Plant Journal 11: 605- 
612) and gene expression is induced by application of a glucocorticoid, for example a 
synthetic glucocorticoid, preferably dexamethasone, preferably at a concentration ranging 
from 0.1 mM to ImM. more preferably from lOmM to lOOmM. For the purposes of the 
present invention, the luciferase gene sequences are replaced by a nucleic acid sequence 
of the invention to form an expression cassette having a nucleic acid sequence of the 
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invention under the control of six copies of the GAL4 upstream activating sequences fused 
to the 35S minimal promoter. This is carried out using methods well known in the art. The 
trans-acting factor comprises the GAL4 DNA-binding domain (Keegan et al. (1986) Science 
231: 699-704) fused to the transactivating domain of the herpes viral protein VP16 
(Triezenberg et al. (1988) Genes Devei 2: 718-729) fused to the hormone-binding domain 
of the rat glucocorticoid receptor (Picard et al. (1988) Ce//54: 1073-1080). The expression 
of the fusion protein is controlled by any promoter suitable for expression in plants known in 
the art or described here. This expression cassette is also comprised in the plant comprising 
a nucleic acid sequence of the invention fused to the 6xGAL4/minimal promoter. Thus, 
tissue- or organ-specificity of the fusion protein is achieved leading to inducible tissue- or 
organ-specificity of the insecticidal toxin. 

g. Root Specific Expression: 

Another pattern of gene expression is root expression. A suitable root promoter is 
described by de Framond (FEBS 290: 103-106 (1991)) and also in the published patent 
application EP 0 452 269, which is herein incorporated by reference. This promoter is 
transferred to a suitable vector such as pCGN1761 ENX for the insertion of a selected gene 
and subsequent transfer of the entire promoter-gene-terminator cassette to a transfomriation 
vector of interest. 

h. Wound-lnducible Promoters: 

Wound-inducible promoters may also be suitable for gene expression. Numerous 
such promoters have been described (e.gf. Xu et ai Plant Molec. Biol. 22: 573-588 (1993), 
Logemann et ai Plant Cell 1: 151-158 (1989), Rohrmeier & Lehle, Plant Molec. Biol. 22: 
783-792 (1993), Firek et ai Plant Molec. Biol. 22: 129-142 (1993), Warner et ai Plant J. 3: 
191-201 (1993)) and all are suitable for use with the instant invention. Logemann et ai 
describe the 5' upstream sequences of the dicotyledonous potato want gene. Xu et ai 
show that a wound-inducible promoter from the dicotyledon potato (p/n2) is active in the 
monocotyledon rice. Further, Rohrmeier & Lehle describe the cloning of the maize Wipl 
cDNA which is wound induced and which can be used to isolate the cognate promoter using 
standard techniques. Similar, Firek et ai and Warner et ai have described a wound- 
induced gene from the monocotyledon Asparagus officinalis, which is expressed at local 
wound and pathogen invasion sites. Using cloning techniques well known in the art, these 
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promoters can be transferred to suitable vectors, fused to the genes pertaining to this 
invention, and used to express these genes at the sites of plant wounding. 

i. Pith-Preferred Expression: 

Patent Application WO 93/07278, which is herein incorporated by reference, 
describes the isolation of the maize trpA gene, which is preferentially expressed in pith 
cells. The gene sequence and promoter extending up to -1726 bp from the start of 
transcription are presented. Using standard molecular biological techniques, this promoter, 
or parts thereof, can be transferred to a vector such as pCGN1761 where it can replace the 
35S promoter and be used to drive the expression of a foreign gene in a pith-preferred 
manner. In fact, fragments containing the pith-preferred promoter or parts thereof can be 
transferred to any vector and modified for utility in transgenic plants. 

j. Leaf -Specific Expression: 

A maize gene encoding phosphoenol carboxylase (PEPC) has been described by 
Hudspeth & Grula (Plant Molec Biol 12: 579-589 (1989)). Using standard molecular 
biological techniques the promoter for this gene can be used to drive the expression of any 
gene in a leaf-specific manner in transgenic plants. 

k. Pollen-Specific Expression: 

WO 93/07278 describes the isolation of the maize calcium-dependent protein kinase 
(CDPK) gene which is expressed in pollen cells. The gene sequence and promoter extend 
up to 1400 bp from the start of transcription. Using standard molecular biological 
techniques, this promoter or parts thereof, can be transferred to a vector such as 
pCGN1761 where it can replace the 35S promoter and be used to drive the expression of a 
nucleic acid sequence of the invention in a pollen-specific manner. 

2. Transcriptional Terminators 

A variety of transcriptional terminators are available for use in expression cassettes. 
These are responsible for the termination of transcription beyond the transgene and its 
correct polyadenylation. Appropriate transcriptional terminators are those that are known to 
function in plants and include the CaMV 35S terminator, the tml terminator, the nopaline 
synthase terminator and the pea rbcS E9 terminator. These can be used in both 
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monocotyledons and dicotyledons. In addition, a gene's native transcription terminator may 
be used. 

3. Sequences for the Enhancement or Regulation of Expression 
Numerous sequences have been found to enhance gene expression from within the 
transcriptional unit and these sequences can be used in conjunction with the genes of this 
invention to increase their expression in transgenic plants. 

Various intron sequences have been shown to enhance expression, particularly in 
monocotyledonous cells. For example, the introns of the maize AdhI gene have been found 
to significantly enhance the expression of the wild-type gene under its cognate promoter 
when introduced into maize cells. Intron 1 was found to be particulariy effective and 
enhanced expression in fusion constructs with the chloramphenicol acetyltransferase gene 
(Callis etal.. Genes Develop. 1; 1183-1200 (1987)). In the same experimental system, the 
intron from the maize bronzel gene had a similar effect in enhancing expression. Intron 
sequences have been routinely incorporated into plant transformation vectors, typically 
within the non-translated leader. 

A number of non-translated leader sequences derived from viruses are also known to 
enhance expression, and these are particularly effective in dicotyledonous cells. 
Specifically, leader sequences from Tobacco Mosaic Virus (TMV, the "W -sequence"). Maize 
Chlorotic Mottle Virus (MCMV). and Alfalfa Mosaic Virus (AMV) have been shown to be 
effective in enhancing expression {e.g. Gallie et al. Nucl. Acids Res. 15: 8693-8711 (1987); 
Skuzeski etal. Plant Molec. Biol. 15: 65-79 (1990)). 

4. Targeting of the Gene Product Within the Cell 

Various mechanisms for targeting gene products are known to exist in plants and the 
sequences controlling the functioning of these mechanisms have been characterized in 
some detail. For example, the targeting of gene products to the chloroplast is controlled by 
a signal sequence found at the amino terminal end of various proteins which is cleaved 
during chloroplast import to yield the mature protein (e.g. Comai ef a/. J. Biol. Chem. 263 : 
15104-15109 (1988)). These signal sequences can be fused to heterologous gene 
products to effect the import of heterologous products into the chloroplast (van den Broeck. 
et al. Nature 313: 358-363 (1985)). DNA encoding for appropriate signal sequences can be 
isolated from the 5' end of the cDNAs encoding the RUBISCO protein, the CAB protein, the 
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EPSP synthase enzyme, the GS2 protein and many other proteins which are known to be 
chloroplast localized. See also, the section entitled "Expression With Chloroplast Targeting" 
in Example 37 of U.S. Patent No. 5,639,949. 

Other gene products are localized to other organelles such as the mitochondrion and 
the peroxisome {e.g. Unger et aL Plant Molec. Biol. 13: 411-418 (1989)). The cDNAs 
encoding these products can also be manipulated to effect the targeting of heterologous 
gene products to these organelles. Examples of such sequences are the nuclear-encoded 
ATPases and specific aspartate amino transferase isoforms for mitochondria. Targeting 
cellular protein bodies has been described by Rogers et al. (Proc. Natl. Acad. Sci. USA 82: 
6512-6516 (1985)). 

In addition, sequences have been characterized which cause the targeting of gene 
products to other cell compartments. Amino terminal sequences are responsible for 
targeting to the ER, the apoplast. and extracellular secretion from aleurone cells (Koehler & 
Ho, Plant Cell 2: 769-783 (1990)). Additionally, amino terminal sequences in conjunction 
with carboxy terminal sequences are responsible for vacuolar targeting of gene products 
(Shinshi etaL Plant Molec. Biol. 14: 357-368 (1990)). 

By the fusion of the appropriate targeting sequences described above to transgene 
sequences of interest it is possible to direct the transgene product to any organelle or cell 
compartment. For chloroplast targeting, for example, the chloroplast signal sequence from 
the RUBISCO gene, the CAB gene, the EPSP synthase gene, or the GS2 gene is fused in 
frame to the amino terminal ATG of the transgene. The signal sequence selected should 
include the known cleavage site, and the fusion constructed should take into account any 
amino acids after the cleavage site which are required for cleavage. In some cases this 
requirement may be fulfilled by the addition of a small number of amino acids between the 
cleavage site and the transgene ATG or, alternatively, replacement of some amino acids 
within the transgene sequence. Fusions constructed for chloroplast import can be tested 
for efficacy of chloroplast uptake by in vitro translation of in vitro transcribed constructions 
followed by in vitro chloroplast uptake using techniques described by Bartlett et al. In: 
Edelmann et al. (Eds.) Methods in Chloroplast Molecular Biology. Elsevier pp 1081-1091 
(1982) and Wasmann et al. Mol. Gen. Genet. 205: 446-453 (1986). These construction 
techniques are well known in the art and are equally applicable to mitochondria and 
peroxisomes. 
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The above-described mechanisms for cellular targeting can be utilized not only in 
conjunction with their cognate promoters, but also in conjunction with heterologous 
promoters so as to effect a specific cell-targeting goal under the transcriptional regulation of 
a promoter that has an expression pattern different to that of the promoter from which the 
targeting signal derives. 

Example 24: Construction of Plant Transformation Vectors 

Numerous transformation vectors available for plant transformation are known to 
those of ordinary skill in the plant transformation arts, and the genes pertinent to this 
invention can be used in conjunction with any such vectors. The selection of vector will 
depend upon the preferred transformation technique and the target species for 
transformation. For certain target species, different antibiotic or herbicide selection markers 
may be preferred. Selection markers used routinely in transformation include the nptll 
gene, which confers resistance to kanamycin and related antibiotics (Messing & Vierra. 
Gene 19: 259-268 (1982); Bevan et al., Nature 304:184-187 (1983)). the bar gene, which 
confers resistance to the herbicide phosphinothricin (White et al., Nucl. Acids Res 18: 1062 
(1990), Spencer et al. Theor. Appl. Genet 79: 625-631 (1990)). the hph gene, which confers 
resistance to the antibiotic hygromycin (Blochinger & Diggelmann, Mol Cell Biol 4: 2929- 
2931). and the dhfr gene, which confers resistance to methatrexate (Bourouis et al.. EMBO 
J. 2IZ}: 1099-1104 (1983)). and the EPSPS gene, which confers resistance to glyphosate 
(U.S. Patent Nos. 4.940,935 and 5.188.642). 

1 . Vectors Suitable for Agrobactehum Transformation 

Many vectors are available for transformation using Agrobactehum tumefaciens. 
These typically carry at least one T-DNA border sequence and include vectors such as 
pBIN19 (Bevan. Nucl. Acids Res. (1984)) and pXYZ. Below, the construction of two typical 
vectors suitable for Agrobacterium transformation is described. 

a. PCIB200 and pCiB2001 : 

The binary vectors pclB200 and pCIB2001 are used for the construction of 
recombinant vectors for use with Agrobacterium and are constructed in the following 
manner. pTJS75kan is created by Narl digestion of pTJS75 (Schmidhauser & Helinski, J. 
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Bacteriol. 164- 446-455 (1985)) allowing excision of the tetracycline-resistance gene, 
followed by insertion of an /\cc/ fragment from pUC4K carrying an NPTII (Messing & Vierra. 
Gene 19: 259-268 (1982): Bevan et aL, Nature 304: 184-187 (1983): McBride et al., Plant 
Molecular Biology 14: 266-276 (1990)). Xhol linkers are ligated to the Ecofl\/ fragment of 
PC1B7 which contains the left and right T-DNA borders, a plant selectable nos/npf// chimeric 
gene and the pUC polylinker (Rothstein et al.. Gene 53: 153-161 (1987)), and the Xhol- 
digested fragment are cloned into Sa//-digested pTJS75kan to create pCIB200 (see also EP 
0 332 104, example 19). pCIB200 contains the following unique polylinker restriction sites: 
EcoRI, SstI, Kpnl, Bglll, Xbal, and SalL pCIB2001 is a derivative of pCIB200 created by the 
insertion into the polylinker of additional restriction sites. Unique restriction sites in the 
polylinker of pCIB2001 are EcoRI, SstI, Kpnl, Bglll, Xbal, Sail, Mlul, Bell, Avrll, Apal, Hpal, 
and Stul. pCIB2001 , in addition to containing these unique restriction sites also has plant 
and bacterial kanamycin selection, left and right T-DNA borders for AgrobacteriunhmB<X\BXe6 
transformation, the RK2-derived trfA function for mobilization between E. coli and other 
hosts, and the Or/Tand Or/V functions also from RK2. The pCIB2001 polylinker is suitable 
for the cloning of plant expression cassettes containing their own regulatory signals. 

b. pCIBI 0 and Hygromycin Selection Derivatives thereof: 

The binary vector pCIBIO contains a gene encoding kanamycin resistance for 
selection in plants and T-DNA right and left border sequences and incorporates sequences 
from the wide host-range plasmid pRK252 allowing it to replicate in both E. coli and 
Agrobacterium. Its construction is described by Rothstein etaL (Gene 53: 153-161 (1987)). 
Various derivatives of pCIBIO are constructed which incorporate the gene for hygromycin B 
phosphotransferase described by Gritz etaL (Gene 25: 179-188 (1983)). These derivatives 
enable selection of transgenic plant cells on hygromycin only (pCIB743), or hygromycin and 
kanamycin (pCIB715. pCIB717). 

2. Vectors Suitable for non-Agrobacterium Transformation 

Transformation without the use of Agrobacterium tumefaciens circumvents the 
requirement for T-DNA sequences in the chosen transformation vector and consequently 
vectors lacking these sequences can be utilized in addition to vectors such as the ones 
described above which contain T-DNA sequences. Transformation techniques that do not 
rely on Agrobacterium include transformation via particle bombardment, protoplast uptake 
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(e.g. PEG and electroporation) and microinjection. The choice of vector depends largely on 
the preferred selection for the species being transformed. Below, the construction of typical 
vectors suitable for non-Agrobacterium transformation is described. 

a. pCIB3064: 

pCIB3064 is a pUC-derived vector suitable for direct gene transfer techniques in 
combination with selection by the herbicide basta (or phosphinothricin). The plasmid 
pCIB246 comprises the CaMV 35S promoter in operational fusion to the E. coli GUS gene 
and the CaMV 35S transcriptional terminator and is described in the PCT published 
application WO 93/07278. The 35S promoter of this vector contains two ATG sequences 5* 
of the start site. These sites are mutated using standard PCR techniques in such a way as 
to remove the ATGs and generate the restriction sites Sspl and Pvull. The new restriction 
sites are 96 and 37 bp away from the unique Sail site and 101 and 42 bp away from the 
actual start site. The resultant derivative of pCIB246 is designated pCIB3025. The GUS 
gene is then excised from pCIB3025 by digestion with Sail and Sac/, the termini rendered 
blunt and religated to generate plasmid pCIB3060. The plasmid pJIT82 Is obtained from the 
John Innes Centre, Nonwich and the a 400 bp S/na/ fragment containing the dar gene from 
Streptomyces viridochromogenes is excised and inserted into the Hpal site of pCIB3060 
(Thompson et ai EMBO J 6: 2519-2523 (1987)). This generated pCIB3064, which 
comprises the bar gene under the control of the CaMV 35S promoter and terminator for 
herbicide selection, a gene for ampicillin resistance (for selection in E. coli) and a polylinker 
with the unique sites SphI, PstI, Hindlll, and BamHL This vector is suitable for the cloning 
of plant expression cassettes containing their own regulatory signals. 

b. pSOG19 and pSOG35: 

pSOG35 is a transformation vector that utilizes the E. coll gene dihydrofolate 
reductase (DFR) as a selectable marker conferring resistance to methotrexate. PCR is 
used to amplify the 35S promoter (-800 bp), intron 6 from the maize Adh1 gene (-550 bp) 
and 18 bp of the GUS untranslated leader sequence from pSOGIO. A 250-bp fragment 
encoding the E. coli dihydrofolate reductase type II gene is also amplified by PCR and 
these two PCR fragments are assembled with a Sac/-Psf/ fragment from pB1221 (Clontech) 
which comprises the pUC19 vector backbone and the nopaline synthase terminator. 
Assembly of these fragments generates pSOG19 which contains the 35S promoter in fusion 
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with the intron 6 sequence, the GUS leader, the DHFR gene and the nopaline synthase 
terminator. Replacement of the GUS leader in pSOG19 with the leader sequence from 
Maize Chlorotic Mottle Virus (MCMV) generates the vector pSOG35. pSOG19 and pSOG35 
carry the pUC gene for ampicillin resistance and have Hindlll, SphI, PstI and EcoRI sites 
available for the cloning of foreign substances. 

Example 25: Transformation 

Once a nucleic acid sequence of the invention has been cloned into an expression 
system, It is transformed into a plant cell. Methods for transformation and regeneration of 
plants are well known in the art. For example, Ti plasmid vectors have been utilized for the 
delivery of foreign DNA, as well as direct DNA uptake, liposomes, electroporation, micro- 
injection, and microprojectiles. In addition, bacteria from the genus Agrobacterium can be 
utilized to transform plant cells. Below are descriptions of representative techniques for 
transforming both dicotyledonous and monocotyledonous plants. 

1 . Transformation of Dicotyledons 

Transformation techniques for dicotyledons are well known in the art and include 
Agrobacterium-based techniques and techniques that do not require Agrobacterium. Non- 
Agrobacterium techniques involve the uptake of exogenous genetic material directly by 
protoplasts or cells. This can be accomplished by PEG or electroporation mediated uptake, 
particle bombardment-mediated delivery, or microinjection. Examples of these techniques 
are described by Paszkowski ef a/., EMBO J 3: 2717-2722 (1984). Potrykus et al„ Mol. Gen. 
Genet. 199: 169-177 (1985). Reich ef a/.. Biotechnology 4: 1001-1004 (1986), and Klein et 
aL, Nature 327: 70-73 (1987). In each case the transformed cells are regenerated to whole 
plants using standard techniques known in the art. 

Agrobacterium-mediated transformation is a preferred technique for transformation of 
dicotyledons because of its high efficiency of transformation and its broad utility with many 
different species. Agrobacterium transformation typically involves the transfer of the binary 
vector carrying the foreign DNA of interest {e.g. pCIB200 or pCIB2001) to an appropriate 
Agrobacterium strain which may depend of the complement of wr genes carried by the host 
Agrobacterium strain either on a co-resident Ti plasmid or chromosomally (e.g. strain 
CIB542 for pCIB200 and pCIB2001 (Uknes et al. Plant Cell 5: 159-169 (1993)). The 
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transfer of the recombinant binary vector to Agrobacterium is accomplished by a triparental 
mating procedure using E. co// carrying the recombinant binary vector, a helper E. co// strain 
which carries a plasmid such as pRK2013 and which is able to mobilize the recombinant 
binary vector to the target Agrobacterium strain. Alternatively, the recombinant binary 
vector can be transferred to Agrobacteriun) by DNA transformation (Hofgen & Willmitzer, 
Nucl. Acids Res. 16: 9877 (1988)). 

Transformation of the target plant species by recombinant Agrobacterium usually 
involves co-cultivation of the Agrobacterium with explants from the plant and follows 
protocols well known in the art. Transformed tissue is regenerated on selectable medium 
carrying the antibiotic or herbicide resistance marker present between the binary plasmid T- 
DNA borders. 

Another approach to transforming plant cells with a gene involves propelling inert or 
biologically active particles at plant tissues and cells. This technique is disclosed in U.S. 
Patent Nos. 4,945.050, 5,036,006, and 5,100,792 all to Sanford et al. Generally, this 
procedure involves propelling inert or biologically active particles at the cells under 
conditions effective to penetrate the outer surface of the cell and afford incorporation within 
the interior thereof. When inert particles are utilized, the vector can be introduced into the 
cell by coating the particles with the vector containing the desired gene. Alternatively, the 
target cell can be surrounded by the vector so that the vector is carried into the cell by the 
wake of the particle. Biologically active particles (e.g., dried yeast cells, dried bacterium or a 
bacteriophage, each containing DNA sought to be introduced) can also be propelled into 
plant cell tissue. 

2. Transformation of Monocotyledons 

Transformation of most monocotyledon species has now also become routine. 
Preferred techniques include direct gene transfer into protoplasts using PEG or 
electroporation techniques, and particle bombardment into callus tissue. Transformations 
can be undertaken with a single DNA species or multiple DNA species (/.e. co- 
transformation) and both these techniques are suitable for use with this invention. Co- 
transformation may have the advantage of avoiding complete vector construction and of 
generating transgenic plants with unlinked loci for the gene of interest and the selectable 
marker, enabling the removal of the selectable marker in subsequent generations, should 
this be regarded desirable. However, a disadvantage of the use of co-transformation is the 



BNSDOCID: <WO 9942589A2_I_> 



wo 99/42589 



PCT/EP99/01015 



-61 - 

less than 100% frequency with which separate DNA species are integrated into the genome 
(Schocher etal. Biotechnology 4: 1093-1096 (1986)). 

Patent Applications EP 0 292 435, EP 0 392 225, and WO 93/07278 describe 
techniques for the preparation of callus and protoplasts from an elite inbred line of maize, 
transformation of protoplasts using PEG or electroporation, and the regeneration of maize 
plants from transformed protoplasts. Gordon-Kamm et al. (Plant Cell 2: 603-618 (1990)) 
and Fromm et al. (Biotechnology 8: 833-839 (1990)) have published techniques for 
transformation of A188-derived maize line using particle bombardment. Furthermore, 
WO 93/07278 and Koziel etal. (Biotechnology 11: 194-200 (1993)) describe techniques for 
the transformation of elite inbred lines of maize by particle bombardment. This technique 
utilizes immature maize embryos of 1 .5-2.5 mm length excised from a maize ear 14-15 days 
after pollination and a PDS-IOOOHe Biolistics device for bombardment. 

Transformation of rice can also be undertaken by direct gene transfer techniques 
utilizing protoplasts or particle bombardment. Protoplast-mediated transformation has been 
described for Japon/ca-types and /nd/ca-types (Zhang et ai Plant Cell Rep 7: 379-384 
(1988); Shimamoto etai Nature 338: 274-277 (1989); Datta etaL Biotechnology 8: 736-740 
(1990)). Both types are also routinely transformable using particle bombardment (Christou 
et aL Biotechnology 9: 957-962 (1991)). Furthermore, WO 93/21335 describes techniques 
for the transformation of rice via electroporation. 

Patent Application EP 0 332 581 describes techniques for the generation, 
transformation and regeneration of Pooideae protoplasts. These techniques allow the 
transformation of Dactylls and wheat. Furthermore, wheat transfonmation has been 
described by Vasil et al. (Biotechnology 10: 667-674 (1992)) using particle bombardment 
into cells of type C long-term regenerable callus, and also by Vasil et al. (Biotechnology IJ.: 
1553-1558 (1993)) and Weeks etaL (Plant Physiol. 102: 1077-1084 (1993)) using particle 
bombardment of immature embryos and immature embryo-derived callus. A preferred 
technique for wheat transformation, however, involves the transformation of wheat by 
particle bombardment of immature embryos and includes either a high sucrose or a high 
maltose step prior to gene delivery. Prior to bombardment, any number of embryos (0.75-1 
mm in length) are plated onto MS medium with 3% sucrose (Murashiga & Skoog, 
Physiologia Plantarum 15: 473-497 (1962)) and 3 mg/l 2,4-D for induction of somatic 
embryos, which is allowed to proceed in the dark. On the chosen day of bombardment, 
embryos are removed from the induction medium and placed onto the osmoticum (/.e. 
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induction medium with sucrose or maltose added at the desired concentration, typically 
15%). The embryos are allowed to plasmolyze for 2-3 h and are then bombarded. Twenty 
embryos per target plate is typical, although not critical. An appropriate gene-carrying 
plasmid (such as pCIB3064 or pSG35) is precipitated onto micrometer size gold particles 
using standard procedures. Each plate of embryos is shot with the DuPont Biolistics® 
helium device using a burst pressure of -1000 psi using a standard 80 mesh screen. After 
bombardment, the embryos are placed back into the dark to recover for about 24 h (still on 
osmoticum). After 24 hrs, the embryos are removed from the osmoticum and placed back 
onto induction medium where they stay for about a month before regeneration. 
Approximately one month later the embryo expiants with developing embryogenic callus are 
transferred to regeneration medium (MS + 1 mg/liter NAA, 5 mg/liter GA), further containing 
the appropriate selection agent (10 mg/l basta in the case of pCIB3064 and 2 mg/l 
methotrexate in the case of pSOG35). After approximately one month, developed shoots 
are transferred to larger sterile containers known as "GA7s" which contain half-strength MS, 
2% sucrose, and the same concentration of selection agent. 

Tranformation of monocotyledons using Agrobacterium has also been described. 
See, WO 94/00977 and U.S. Patent No. 5,591.616. both of which are incorporated herein 
by reference. 

C Breeding and Seed Production 

Example 26: Breeding 

The plants obtained via tranformation with a nucleic acid sequence of the present 
invention can be any of a wide variety of plant species, including those of monocots and 
dicots; however, the plants used in the method of the invention are preferably selected from 
the list of agronomically important target crops set forth supra. The expression of a gene of 
the present invention in combination with other characteristics important for production and 
quality can be incorporated into plant lines through breeding. Breeding approaches and 
techniques are known in the art. See, for example. Welsh J. R., Fundamentals of Plant 
Genetics and Breeding, John Wiley & Sons, NY (1981); Crop Breeding, Wood D. R. (Ed.) 
American Society of Agronomy Madison, Wisconsin (1983); Mayo O.. The Theory of Plant 
Breeding, Second Edition. Clarendon Press, Oxford (1987); Singh, D.P.. Breeding for 
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Resistance to Diseases and Insect Pests, Springer-Verlag. NY (1986); and Wricke and 
Weber, Quantitative Genetics and Selection Plant Breeding, Walter de Gruyter and Co.. 
Berlin (1986). 

The genetic properties engineered into the transgenic seeds and plants described 
above are passed on by sexual reproduction or vegetative growth and can thus be 
maintained and propagated in progeny plants. Generally said maintenance and propagation 
make use of known agricultural methods developed to fit specific purposes such as tilling, 
sowing or harvesting. Specialized processes such as hydroponics or greenhouse 
technologies can also be applied. As the growing crop is vulnerable to attack and damages 
caused by insects or infections as well as to competition by weed plants, measures are 
undertaken to control weeds, plant diseases, insects, nematodes, and other adverse 
conditions to improve yield. These include mechanical measures such a tillage of the soil or 
removal of weeds and infected plants, as well as the application of agrochemicals such as 
herbicides, fungicides, gametocides, nematicides, growth regulants, ripening agents and 
insecticides. 

Use of the advantageous genetic properties of the transgenic plants and seeds 
according to the invention can further be made in plant breeding, which aims at the 
development of plants with improved properties such as tolerance of pests, herbicides, or 
stress, improved nutritional value, increased yield, or improved structure causing less loss 
from lodging or shattering . The various breeding steps are characterized by well-defined 
human intervention such as selecting the lines to be crossed, directing pollination of the 
parental lines, or selecting appropriate progeny plants. Depending on the desired 
properties, different breeding measures are taken. The relevant techniques are well known 
in the art and include but are not limited to hybridization, inbreeding, backcross breeding, 
multiline breeding, variety blend, interspecific hybridization, aneuploid techniques, etc. 
Hybridization techniques also include the sterilization of plants to yield male or female 
sterile plants by mechanical, chemical, or biochemical means. Cross pollination of a male 
sterile plant with pollen of a different line assures that the genome of the male sterile but 
female fertile plant will uniformly obtain properties of both parental lines. Thus, the 
transgenic seeds and plants according to the invention can be used for the breeding of 
improved plant lines, that for example, increase the effectiveness of conventional methods 
such as herbicide or pestidice treatment or allow one to dispense with said methods due to 
their modified genetic properties. Alternatively new crops with improved stress tolerance can 
be obtained, which, due to their optimized genetic "equipment", yield harvested product of 
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better quality than products that were not able to tolerate comparable adverse 
developmental conditions. 

Example 27: Seed Production 

In seed production, germination quality and uniformity of seeds are essential product 
characteristics, whereas germination quality and uniformity of seeds han/ested and sold by 
the farmer is not important. As it is difficult to keep a crop free from other crop and weed 
seeds, to control seedborne diseases, and to produce seed with good germination, fairly 
extensive and well-defined seed production practices have been developed by seed 
producers, who are experienced in the art of growing, conditioning and marketing of pure 
seed. Thus, it is common practice for the farmer to buy certified seed meeting specific 
quality standards instead of using seed harvested from his own crop. Propagation material 
to be used as seeds is customarily treated with a protectant coating comprising herbicides, 
insecticides, fungicides, bactericides, nematicides. molluscicides, or mixtures thereof. 
Customarily used protectant coatings comprise compounds such as captan. carboxin, 
thiram (TMTD*). methalaxyl (Apron®), and pirimiphos-methyl (Actellic^). If desired, these 
compounds are formulated together with further carriers, surfactants or application- 
promoting adjuvants customarily employed in the art of formulation to provide protection 
against damage caused by bacterial, fungal or animal pests. The protectant coatings may 
be applied by Impregnating propagation material with a liquid formulation or by coating with 
a combined wet or dry formulation. Other methods of application are also possible such as 
treatment directed at the buds or the fruit. 

It is a further aspect of the present invention to provide new agricultural methods, 
such as the methods examplified above, which are characterized by the use of transgenic 
plants, transgenic plant material, or transgenic seed according to the present invention. 

The seeds may be provided in a bag, container or vessel comprised of a suitable 
packaging material, the bag or container capable of being closed to contain seeds. The 
bag, container or vessel may be designed for either short tenn or long temi storage, or both, 
of the seed. Examples of a suitable packaging material include paper, such as kraft paper, 
rigid or pliable plastic or other polymeric material, glass or metal. Desirably the bag, 
container, or vessel is comprised of a plurality of layers of packaging materials, of the same 
or differing type. In one embodiment the bag, container or vessel is provided so as to 
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exclude or limit water and moisture from contacting the seed. In one example, the bag, 
container or vessel is sealed, for example heat sealed, to prevent water or moisture from 
entering. In another embodiment water absorbent materials are placed between or 
adjacent to packaging material layers. In yet another embodiment the bag, container or 
vessel, or packaging material of which it is comprised is treated to limit, suppress or 
prevent disease, contamination or other adverse affects of storage or transport of the seed. 
An example of such treatment is sterilization, for example by chemical means or by 
exposure to radiation. Comprised by the present invention is a commercial bag comprising 
seed of a transgenic plant comprising a gene of the present invention that is expressed in 
said transformed plant at higher levels than in a wild type plant, together with a suitable 
carrier, together with label instructions for the use thereof for conferring broad spectrum 
disease resistance to plants. 
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* Squlvalent ' Kot equivalent to depoalt on /- A-^Q (Date) 

Signature of Depositor 



TV- CONDITIOH3 UNDBR WHICH THK VIABILITY TBST WAB PBRPORMBD (Deoosltore /Deoosltary) ' 



IV. CpNDITXOWS UWOl 

TKfi. d<-i4d Cuii7<Jr^ u 



\ WHICH THK VIABILITY TBST WAB PBRPORMBD (Deoosi 

King.. Sam^ lifui^i OuffurcTZjo^ s4tp<x 



, <^rown ajh 3"7»C 



V. IWTBRHATIONAL DtPOSITARY AUTHORITY 



HAinet Agricultural Reeearoh Culture 
collection (NRRZ.) 
International Depositary Authority 

Address t 1815 N. University Street 
WPifl, TUtneHf ^1W4 ?^Pr^. 



81gnatttre<8) of pereonfs) having che power 
to repreaent the International Depoeltary 
Authority or of authorised ef£lolal(e>i 



^ Indie«ca th« ^%xm of Ch« ociginal dttpo«lc or wA«n a naw dvpofl&c haa hmmn awd*. 

• nar^ wich • cromm %,f\m appi^Mbl* boa. 

* Tn CA* eaaaa ratecxao to in RyXa lO.ZfaHllI and e«t«c to cha moat xaeanc viability case, 
f r^ix in i< tha inloamacien naa oaan caqiiojtoO. 
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BXmPBST TRSATT ON THB XKT&R2OITXC3B01L 
RSOOGSZTZOK OP THS DEPOSIT OF MTrPQopgni^TCtMS 
FOR THB PDRPOSS OP PATBRT PROCSDDRBS 



ZUTSSNATIONAL FQRU 



TO 

Novarcis Corp . 
c/o Movarris AO 
p. O. Box 12257 

Reaoarcn Triangle Park, NC 27709 


RECEIPT 129 THE CASE OF AN ORIGIK3VL DEPOSIT 
iaauad pursuant to Rule 7.1 by the 
I21TERKATIONAI* DEPOSITARY AUTKORITV 
identified at the boccom of this page 


MRUS AMD ADORSSS 
OF DEPOSITOR 




I . IDBRTIPTCATIOM OF THE MICROORCRMISM 


Idancificacion reference given by the 
DEPOSITOR: 


Aeeeaeion number given by che 
INTERKATXOZUU. DEPOSITARY AUTHORITY: 


Bacteria sp. pCXB 9359-7 


MRRI* 3*21835 



II. SCIBinTFrC DESCRIPTION AHD/OR PROPOSED TAXOEDMIC DBSIOKATIOW 



The microorganiam identified under I. above waa accompanied by: 
a scientific deacription 



I I a propoeed taxonomic deeignacion 

(Marfe with a crosa where apoli cable) 
III. RECEIPT AND ACCEPTANCE 



This intoroationai Depositary Authority acoapts the microorganism identified under I. 
above, which was received by it on September 17, 1997 (dare of the original deposit)' 



IV. RECEIPT OP RBQ0S3T FOR CONVERSION 

The microorganism Identified under I. above was received by this International 
Depositary Authority on (date of the original deposit) and a request 

to convert the original deposit to a deposit under the Budapest Treaty was received by 
it on (date of receipt of request for conversion) . 



INTERNATIONAL DEPOSITARY AUTHORITY 



Name: Agricultural Research Culture 
Collection (NRRli) 
International Depositary Authority 

Address : 1B15 N. Oniversity Street 

Peoria, Illinois 61604 U.S.A. 



Signature is) of parson is) having the power 
to represent the International Depositaryr 
Authority or of authorised official (s): 



Date : 



Where Rule 6.4 <d) applies, such date is the date on which the status of intamational 
depositary authority was acquired. 
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BDQAPBST TPEKTt ON TOB XXITBSaiATXOSIUU iZSCOOHZTIOtl OP TKB DEPOSIT 
OP HZCROaaBABIfiMS FOR TRS PDRPQ8B OF P&TBBT PROCBDOSBS 



TO 

Hovarcia Corp. 
c/o Novartis AO 
P. O. Boob 11357 

Raaeareh TrlangXe P&rk, HC 27709 
HAMS iUID ADDBB66 OP IBS PARTY TO WHDU 

THg viAaii»mf HTATHMBtrr is issokp 



YTMTTiTTy aTATBMRTTT 

pursiiAnt to Rule 10.2 by ch« 
iRTSBKATIOflAI* OEPOSITARY AtSTHOimY 
ideatlflad at t^ia boccom o£ chla page 



DgpogiTOR 



Hame: Hovarcia Cozp 

e/o Movarcia AO 

Addreas: P. O. Box 12257 

Baaaarch Triangle Parle. 
WC 2770P 



II. rpswriFicAnow of thb MicaooBGAansM 



Dapoaitor*8 taxcmcmic daaignacion and 
aeeaaeion number given by the 
UTBRNATIOJSAL OEPOSZXARY AXTTHORITY: 
Baocaria ap. NBBXi B- 21835 

Dace of :Saptainbar 17, 1997 
Original Oepoait 

□ ' Efew Depoail: 

□ * Rapropagati.on ot Original Dapoait 



III. 



(a) VIABII^ITY 9TATgMHOT 



Deposit vaa found: 



Viable 



□ 



Booviabla on September 18^ 1997 (Data) 



International Depoaitary Authority's preparation vaa found viaUsle on September 25, 
1997 (Date) ^ 



III. 



(b) DHPOSrrOR'S equivalsnct DEcuuwnow 



Daposxcor determined the International Depoaitary Authority's preparation vaa 

□ - Equivalent Not equivalent to depoait on (Date) 

Signature of Depositor , 



IV. CONPmOWS X3HDKR WHICH THK VIABILITY TEST WAS PBRFORMBD (DeooBitora/Deooaitarv) ^ 



INTKRHATIDNAL DEPOSITARY AOTHQRITY 



Name: 



Agricultural Research Culture 

Collection (NRRI.) 
International Depoaitary Authority 



Addreaa; 1815 N. University Street 
Peoria. IlllBoia 61€04 O.S.A 



Signature (a) o£ peraon(B) having the power 
to represent cho International Depositary 
Authority or of authorxzed official is) : 

/ -7 



JSft£« 



/I' O 77 



indicate ch* tfaM of ths oci.9inai d«poai.c or wnsn a n«>«f aopoalC r.uj boon nufie. 
Hirk with a cro«0 th« «npi|r:«»*i« fM»s. 

In cito oa»«« rotocr«ci to in ftuio in.2|.i||M.| ^n^* Mill, rntnr tr. r.hu munc •.tiCMnc vi,«bXXXCy CkflC. 
iili Xn it tho intonnatloA haa boon coquoytvcl. 
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What is claimed is: 

1 . An isoiated nucleic acid molecule comprising: 

(a) a nucleotide sequence substantially similar to a nucleotide sequence 
selected from the group consisting of: nucleotides 412-1665 of SEQ ID NO:1, 
nucleotides 1686-2447 of SEQ ID NO:1, nucleotides 2758-3318 of SEQ ID 
NO:1. nucleotides 3342-4118 of SEQ ID NO:1, nucleotides 4515-9269 of 
SEQ ID NO:1, nucleotides 15.171-18,035 of SEQ ID NO:11, and nucleotides 
31 ,393-35,838 of SEQ ID NO:1 1 ; 

(b) a nucleotide sequence comprising nucleotides 23,768-31,336 of SEQ ID 
NO:11;or 

(c) a nucleotide sequence isocoding with the nucleotide sequence of (a) or (b); 
wherein expression of said nucleic acid molecule results in at least one toxin that is active 
against insects. 

2. An isolated nucleic acid molecule comprising a 20 base pair nucleotide portion 
identical in sequence to a consecutive 20 base pair nucleotide portion of a nucleotide 
sequence selected from the group consisting of: nucleotides 412-1665 of SEQ ID NO:1, 
nucleotides 1686-2447 of SEQ ID NO:1, nucleotides 2758-3318 of SEQ ID NO:1, 
nucleotides 3342-4118 of SEQ ID NO:1, nucleotides 4515-9269 of SEQ ID NO:1, 
nucleotides 15,171-18.035 of SEQ ID NO:11, and nucleotides 31,393-35.838 of SEQ ID 
NO:11, wherein expression of said nucleic acid molecule results in at least one toxin that is 
active against insects. 

3. An isolated nucleic acid molecule comprising a nucleotide sequence from 
Photorhabdus luminescens selected from the group consisting of: nucleotides 412-1665 of 
SEQ ID NO:1, nucleotides 1686-2447 of SEQ ID NO:1. nucleotides 2758-3318 of SEQ ID 
NO:1, nucleotides 3342-4118 of SEQ ID NO:1, nucleotides 4515-9269 of SEQ ID NO:1. 
nucleotides 66-1898 of SEQ ID NO:11, nucleotides 2416-9909 of SEQ ID NO:11, the 
complement of nucleotides 2817-3395 of SEQ ID NO:11, nucleotides 9966-14.633 of SEQ 
ID NO:11. nucleotides 14.699-15.007 of SEQ ID NO:11. nucleotides 15,171-18.035 o< SEQ 
ID NO:1 1 , the complement of nucleotides 17,072-17,398 of SEQ ID NO:1 1 , the complement 
of nucleotides 18,235-19.167 of SEQ ID NO:11, the complement of nucleotides 19,385- 
20,116 of SEQ ID NO:11, the complement of nucleotides 20,217-20,963 of SEQ ID NO:11. 
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the complement of nucleotides 22,172-23,086 of SEQ ID NO:11, nucleotides 23,768-31,336 
of SEQ ID NO:11, nucleotides 31.393-35,838 of SEQ ID NO:11, the complement of 
nucleotides 35,383-35,709 of SEQ ID NO:1 1 , the complement of nucleotides 36,032-36,661 
of SEQ ID NO:1 1, and the complement of nucleotides 36.654-37,781 of SEQ ID NO:11. 

4. An isolated nucleic acid molecule according to claim 1, wherein said nucleotide 
sequence is substantially similar to nucleotides 412-1665 of SEQ ID NO:1, nucleotides 
1686-2447 of SEQ ID NO:1, nucleotides 2758-3318 of SEQ ID NO:1, nucleotides 3342- 
41 18 of SEQ ID NO:1 . or nucleotides 4515-9269 of SEQ ID NO:1 . 

5. An isolated nucleic acid molecule according to claim 1, wherein said nucleotide 
sequence encodes an amino acid sequence selected from the group consisting of SEQ ID 
NOs:2-6. 

6. An isolated nucleic acid molecule according to claim 1, wherein said nucleotide 
sequence comprises nucleotides 412-1665 of SEQ ID NO:1, nucleotides 1686-2447 of SEQ 
ID NO:1, nucleotides 2758-3318 of SEQ ID NO:1, nucleotides 3342-4118 of SEQ ID NO:1, 
or nucleotides 4515-9269 of SEQ ID NO:1 . 

7. An isolated nucleic acid molecule according to claim 1, wherein said nucleotide 
sequence is substantially similar to nucleotides 15,171-18,035 or 31,393-35,838 of SEQ ID 
NO:11. 

8. An isolated nucleic acid molecule according to claim 1, wherein said nucleotide 
sequence encodes the amino acid sequence set forth in SEQ ID NOs:12-14. 

9. An isolated nucleic acid molecule according to claim 1, wherein said nucleotide 
sequence comprises nucleotides 15,171-18,035; 23,768-31,336; or 31,393-35,838 of SEQ 
ID NO:11. 

10. An isolated nucleic acid molecule according to claim 2, comprising a 20 base pair 
nucleotide portion identical in sequence to a consecutive 20 base pair nucleotide portion of 
nucleotides 412-1665 of SEQ ID NO:1 . nucleotides 1686-2447 of SEQ ID NO:1 , nucleotides 
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2758-3318 of SEQ ID NO:1, nucleotides 3342-4118 of SEQ ID NO:1. or nucleotides 4515- 
9269 of SEQ ID NO:1. 

11. An isolated nucleic acid molecule according to claim 2, comprising a 20 base pair 
nucleotide portion identical in sequence to a consecutive 20 base pair nucleotide portion of 
nucleotides 15,171-18,035 or 31 .393-35,838 of SEQ ID NO:1 1 . 

12. A chimeric gene comprising a heterologous promoter sequence operatively linked to 
the nucleic acid molecule of claim 1 or claim 2. 

13. A recombinant vector comprising the chimeric gene of claim 12. 

14. A host cell comprising the chimeric gene of claim 12. 

15. A host cell according to claim 14, which is a bacterial cell. 

16. A-host cell according to claim 14, which is a yeast cell. 

17. A host cell according to claim 14, which is a plant cell. 

18. A plant comprising the plant cell of claim 17. 

19. A plant according to claim 18, which is maize. 

20. A toxin produced by the expression of a DNA molecule according to claim 1 or claim 
2. 

21. A toxin according to claim 20, wherein said toxin has activity against Lepidopteran 
insects. 

22. A toxin according to claim 21, wherein said toxin has activity against Plutella xylostella 
(Diamondback Moth). Trichoplusia ni (Cabbage Looper), Ostrinia nubilalis (European Com 
Borer), Heliothis virescens (Tobacco Budworm), Helicoverpa zea (Corn Earworm), 
Spodoptera exigua (Beet Armyworm), and Spodoptera frugiperda (Fall Armyworm). 
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23. A toxin according to claim 20, wherein said toxin has activity against Lepidopteran and 
Coleopteran insects. 

24. A toxin according to claim 23, wherein said toxin has insecticidal activity against 
Plutella xylostella (Diamondback Moth), Ostrinia nubilalis (European Corn Borer), and 
Manduca sexta (Tobacco Hornworm), Diabrotica virgifera virgifera (Western Com 
Rootworm), Diabrotica undecimpunctata howardi (Southern Corn Rootworm). and 
Leptinotarsa decimiineata (Colorado Potato Beetle). 

25. A toxin according to claim 20, wherein said toxin comprises an amino acid sequence 
selected from the group consisting of: SEQ ID NOs:2-6. 

26. A toxin according to claim 20. wherein said toxin comprises an amino acid sequence 
selected from the group consisting of: SEQ ID NOs:12-14. 

27. A composition comprising an insecticidally effective amount of a toxin according to 
claim 20. 

28. A method of producing a toxin that is active against insects, comprising: 

(a) obtaining the host cell of claim 14; and 

(b) expressing the nucleic acid molecule in said cell, which results in at least one 
toxin that is active against insects. 

29. A method of producing an insect-resistant plant, comprising introducing a nucleic acid 
molecule according to claim 1 into said plant, wherein said nucleic acid molecule is 
expressible in said plant in an effective amount to control insects. 

30. A method of controlling insects comprising delivering to the insects an effective 
amount of a toxin according to claim 44. 

31 . The method of claim 29 or claim 30, wherein the insects are Lepidopteran insects. 
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32. The method of claim 31 , wherein the insects are selected from the group consisting 
of: Plutella xylostella (Diamondback Moth), Trichoplusia ni (Cabbage Looper), Ostrinia 
nubilalis (European Com Borer), Heliothis virescens (Tobacco Budworm). Helicoverpa zea 
(Com Eanwonm), Spodoptera exigua (Beet Armyworm), and Spodoptera frugiperda (Fall 
Armyworm). 

33. The method of claim 29 or claim 30, wherein the insects are Lepidopteran and 
Coleopteran insects. 

34. The method of claim 33, wherein the insects are selected from the group consisting 
of: Plutella xylostella (Diamondback Moth), Ostrinia nubilalis (European Corn Borer), and 
Manduca sexta (Tobacco Hornworm), Diabrotica virgifera virgifera (Western Corn 
Rootworm), Diabrotica undecimpunctata howardi (Southern Corn Rootworm), and 
Leptinotarsa decimlineata (Colorado Potato Beetle). 

35. The method of claim 30, wherein the toxin is delivered to the insects orally. 

36. A method for mutagenizing a nucleic acid molecule according to claim 1 , wherein the 
nucleic acid molecule has been cleaved into population of double-stranded random 
fragments of a desired size, comprising: 

(a) adding to the population of double-stranded random fragments one or more 
single- or double-stranded oligonucleotides, wherein said oligonucleotides 
each comprise an area of identity and an area of heterology to a double- 
stranded template polynucleotide; 

(b) denaturing the resultant mixture of double-stranded random fragments and 
oligonucleotides into single-stranded fragments; 

(c) incubating the resultant population of single-stranded fragments with a 
polymerase under conditions which result in the annealing of said single- 
stranded fragments at said areas of identity to form pairs of annealed 
fragments, said areas of identity being sufficient for one member of a pair to 
prime replication of the other, thereby forming a mutagenized double- 
stranded polynucleotide; and 
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(d) repeating the second and third steps for at least two further cycles, wherein 
the resultant mixture in the second step of a further cycle includes the 
mutagenized double-stranded polynucleotide from the third step of the 
previous cycle, and wherein the further cycle forms a further mutagenized 
double-stranded polynucleotide. 
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SEQUENCE 

<110> Novartis AG 

<120> Novel Toxins And Uses lliereof 

<130> PI/5-30421/A/CX3C 1963 

<140> 
<141> 

<160> 22 

<170> Patentin Ver. 2.0 

<210> 1 
<211> 9717 
<212> ENA 

<213> Etetorhabdus luminescens 

<220> 
<221> CDS 

<222> (412) . . (1665) 
<223> orfl -46.4 kDa 

<220> 
<221> CDS 

<222> (1686) . . (2447) 
<223> orf2 -28.1kDa 

<220> 
<221> CDS 

<222> (2758) . , (3318) 
<223> orf3 --20.7 kDa 

<220> 
<221> CDS 

<222> (3342) . . (4118) 
<223> orf4 -28.7 3<Da 

<220> 
<221> CDS 

<222> (4515) . . (9269) 
<223> orfS -176 kDa 

<400> 1 

gaattcatat gctatgaaat aaacagttgg cgcaataatt aaagctatta tttttatttt 60 

gtttttatac aatgatatgc tttattaaac agaataatga gttaatgata aataaatcct: 120 

cgggatittat catgatatta tggccgaatg tgatgtgaac aattatttta taattagatt 180 

aataatataa tggtattaaa ataacaatat atttattcat gggtatttat catcggtttt 240 

attacacggg gaataatcta taaattagtt ttacataatt cacaaatagc gattccatta 300 

accaggaata tcaaaaatac ttatttatga ttatggtgat atatcttcat tagcctactt 360 

ttataactag aaaaattgac attttcaatc catgtataaa tggtaaccaa t atg cag 417 

Met Gin 
1 

aga get caa cga gtt gtt att aca ggt atg ggt gcc gta aca ccg att 465 
Arg Ala Gin Arg Val Val lie Ihr Gly Met Gly Ala Val Ihr Pro He 
5 10 15 
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ggt gaa gat gtt gaa tea tgt tgg caa agt. att att gaa aaa caa cat 513 
Gly Glu Asp Val Glu Ser Cys Trp Gin Ser lie lie Glu Lys Gin His 
20 25 30 

cga ttt cac aga att gaa ttt cct gac tea ttc att aat teg cgt ttc 561 
Arg Phe His Arg lie Glu Phe Pro Asp Ser Phe lie Asn Ser Arg Phe 
35 40 45 50 

ttt tct ttc ctt gca cea aac cea tee egc tat eag tta tta cea aaa 609 
Phe Ser Phe Leu Ala Pro Asn Pro Ser Arg Tyr Gin Leu Leu Pro Lys 
55 60 65 

aag ttg act eat aca ctt tct gac tgc gga aaa gca gca ttg aag gcg 657 
Lys Leu Thr His Thr Leu Ser Asp Cys Gly Lys Ala Ala Leu Lys Ala 
70 75 80 

act tat caa get ttt ace caa gca ttc ggc gtg aat ata tea cct gtt 705 
Tta" Tyr Gin Ala Phe Thr Gin Ala Phe Gly Val Asn lie Ser Pro Val 
85 90 95 

gaa tat tac gat aaa tac gaa tgt ggc gta att ctt ggc agt ggt tgg 753 
Glu Tyr Tyr Asp Lys Tyr Glu Cys Gly Val lie Leu Gly Ser Gly Trp 
100 105 110 

gga get att gat aat gcc gga gat cat get tgc caa tat aag caa gca 801 
Gly Ala He Asp Asn Ala Gly Asp His Ala Cys Gin Tyr Lys Gin Ala 
115 120 125 130 

aaa tta get cat cct atg agt aat ctt att acc atg cea age tec atg 849 
Lys Leu Ala His Pro Met Ser Asn Leu He Thr Met Pro Ser Ser Met 
135 140 145 

acg get gca tgt teg att atg tat gga eta cgt ggt tat caa aat acc 897 
Thr Ala Ala Cys Ser He Met Tyr Gly Leu Arg Gly Tyr Gin Asn Thr 
150 155 160 

gtt atg get gcc tgc gca acg ggc aca atg gcg ata ggc gat gee ttt 945 
Val Met Ala Ala Cys Ala Thr Gly Thr Met Ala He Gly Asp Ala Phe 
165 170 175 

gaa att att egc tea ggg egg gca aaa tgt atg att gcc gga gcc get 993 
Glu He He Arg Ser Gly Arg Ala Lys Cys Met He Ala Gly Ala Ala 
180 185 190 

gaa tea etc acg egg gaa tgt aat att tgg agt att gat gta ctg aat 1041 
Glu Ser Leu Thr Arg Glu Cys Asn He Trp Ser He Asp Val Leu Asn 
195 200 205 210 

gca tta teg aaa gaa caa gcg gac cea aat ctt gca tgt tgt cea ttt 1089 
Ala Leu Ser Lys Glu Gin Ala Asp Pro Asn Leu Ala Cys Cys Pro Phe 
215 220 225 

age ctt gat egc tct gga ttt gta tta gcc gaa gga gcg gcg gta gtt 1137 
Ser Leu Asp Arg Ser Gly Phe Val Leu Ala Glu Gly Ala Ala Val Val 
230 235 240 

tgt ctg gaa aat tat gat tea gcc ate gcg cgt ggt gca acg att tta 1185 
Cys Leu Glu Asn lyr Asp Ser Ala He Ala Arg Gly Ala Ttir He Leu 
245 250 255 

gcg gaa att aaa ggt tac gcc caa tat tea gat gcc gtt aat tta acc 1233 
Ala Glu He Lys Gly Tyr Ala Gin Tyr Ser Asp Ala Val Asn Leu Thr 
260 265 270 
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cgg cca aca gaa gat att gaa cct aaa ata tta gcg ata act aaa gcc 1281 
Arg Pro 'Thr Glu Asp lie Glu Pro Lys He Leu Ala He Thr Lys Ala 
275 280 285 290 

att gag cag gca cag att teg ccg aaa gat att gac tac att aat get 1329 
He Glu Gin Ala Gin He Ser Pro Lys Asp He Asp Tyr He Asn Ala 
295 300 305 

cat ggt act tct aca ccg tta aat gat ctt tat gaa act cag gca att 1377 
His Gly Thr Ser Thr Pro Leu Asn Asp Leu Tyr Glu Thr Gin Ala He 
310 315 320 

aaa gca gca ctg ggc caa tat get tat cag gta cct ata tea age aca 1425 
Lys Ala Ala Leaa Gly Gin Tyr Ala Tyr Gin Val Pro He Ser Ser Ihr 
325 330 335 

aaa tct tat ace ggc cac ctt att get gcc gcc ggt agt ttt gaa aeg 1473 
Lys Ser Tyr Thr Gly His Leu He Ala Ala Ala Gly Ser Phe Glu Thr 
340 345 350 



1521 



1569 



att gta tgt gtg aaa gca tta get gaa aat tgc ttg cca gca aca ttg 
He Val Cys Val Lys Ala Leu Ala Glu Asn Cys Leu Pro Ala Thr Leu 
355 360 365 370 

aat tta cac egg gcc gat cca gat tgc gat etc aat tat ttg cct aat 
Asn Leu His Arg Ala Asp Pro Asp Cys Asp Leu Asn Tyr Leu Pro Asn 
375 380 385 

caa cat tgc tac ace get caa cca gag gtg aca etc aat att age gca 1617 
Gin His cys Tyr Thr Ala Gin Pro Glu Val Thr Leu Asn He Ser Ala 
390 395 400 

ggt tte ggc ggg eat aac get gcg ttg gtt ate get aag gta agg taa 1665 
Gly Phe Gly Gly His Asn Ala Ala Leu Val He Ala Lys Val Arg 
405 410 415 

etgatatgtt gatttttgca atg gaa gat att gaa eat tgg teg aat tte tct 1718 

Met Glu Asp He Glu His Trp Ser Asn Phe Ser 
420 425 

ggg gat ttt aac cec ate cat tat teg gcg aaa age gag tct ttg egc 1766 
Gly Asp Phe Asn Pro He His Tyr Ser Ala Lys Ser Glu Ser Leu Ara 
430 435 440 445 

aat ata cag caa cac ccg gtg cag gga atg ttg agt ttg etc tat gta 1814 
Asn He Gin Gin His Pro Val Gin Gly Met Leu Ser Leu Leu Tyr Val 
450 455 460 

egg caa cag ttt tct caa tta act tee get ttt aca aeg gga ata ttg 1862 
Arg Gin Gin Phe Ser Gin Leu Thr Ser Ala Phe Thr Thr Gly He Leu 
465 470 475 

aac att gat gcc tct tte egc cag tat gtt tat ace gca tta cec cat 1910 
Asn He Asp Ala Ser Phe Arg Gin Tyr Val Tyr Thr Ala Leu Pro His 
480 485 490 

caa etg agg att aat act aaa aac aaa aeg ttt aaa tta gaa aat cec 1958 
Gin I^u Arg He Asn Thr Lys Asn Lys Thr Phe Lys Leu Glu Asn Pro 
495 500 505 

agt aaa gaa aac aeg ttg tte ggc aat acc age gta gag aat aca atg 2006 
Ser Lys Glu Asn Thr I^u Phe Gly Asn Thr Ser Val Glu Asn Thr Met 
510 515 520 525 

gag tea att gaa gat tgg ate gtt cag gat aat tgt caa aaa eta aeg 2054 
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Glu Ser lie Glu Asp Trp lie Veil Gin Asp Asn Cys Gin Lys Leu Itir 
530 535 540 

ata aca ggg gag gaa gtt tgt gaa aag tat get gtc ttt aga tac tat 2102 
lie Thr Gly Glu Glu Val Cys Glu Lys Tyr Ala Val Phe Arg Tyr Tyr 
545 550 555 

ttc cca agt gtc act tct att gga tgg ttc ctg gat gcg ctt get ttt 2150 
Phe Pro Ser Val Ttur Ser lie Gly Trp Phe Leu Asp Ala Leu Ala Phe 
560 565 570 

cat ctt att att aat teg aca gga ttt ctt aat ttt gag cac tac cat 2198 
His Leu lie He Asn Ser Thr Gly Phe Leu Asn Phe Glu His Tyr His 
575 580 585 

ttt aac caa tta cag gat tat ctg agt caa tct ttt act ttg cat act 2246 
Phe Asn Gin Leu Gin Asp Tyr Leu Ser Gin Ser Phe Thr Leu His Thr 
590 595 600 605 

ggg caa gcg att aaa ate agg aag gag att gtt aat agt aca gta tta 2294 
Gly Gin Ala He Lys He Arg Lys Glu He Val Asn Ser Thr Val Leu 
610 615 620 

tta tct tea ccg gat ate tgt gtt gaa tta aat cct cct tta ttg att 2342 
Leu Ser Ser Pro Asp He Cys Val Glu Leu Asn Pro Pro Leu Leu He 
625 630 635 

aag aat ggc gat aaa gat tat att egt att ttc tat tat cga tgt tta 2390 
Lys Asn Gly Asp Lys Asp Tyr He Arg He Phe Tyr Tyr Arg Cys Leu 
640 645 650 

tat gat aaa aaa cct att ttt gta tea aag act tea att ate tct aag 2438 
Tyr Asp Lys Lys Pro He Phe Val Ser Lys Thr Ser He He Ser Lys 
655 660 665 

atg aaa taa aaggaaagcg aaatgeeaac acaaagtgat atttteactg 2487 

Met Lys 

670 

aaataaagaa tagaatatta atgatgaagg atatagaaga tgaagaaata acaccagagt 2547 

eetettttgt ttcgettgaa tttgatagte ttgactatgt ggaaatceaa gtttttgtgt 2607 

tggaagcgta tggtattgtg ettaaagecg aaetttttte aaatcattet attteaacat 2667 

taaatgagct caetgaetat ttaaaatcaa aattgtaate tgaattttta ettaattatg 2727 

ttttttcacc attaaeatta agaggttata atg aac gtt tta gaa caa ggt aag 2781 

Met Asn Val Leu Glu Gin Gly Lys 
675 680 

gtt get get tta tat tea gee tat teg gaa aca gaa ggt tct teg tgg 2829 
Val Ala Ala Leu Tyr Ser Ala Tyr Ser Glu Thr Glu Gly Ser Ser Trp 
685 690 695 

gtg gga aac ttg tgc tgt ttt tea agt gat egg gag cat ttg cct att 2877 
Val Gly Asn I^u Cys Cys Phe Ser Ser Asp Arg Glu His Leu Pro He 
700 705 710 

ate gtg aat ggg cgt cgt ttc ttg att gaa ttt gtt att cca gat cat 2925 
He Val Asn Gly Arg Arg Phe Leu He Glu Phe Val He Pro Asp His 
715 720 725 

tta ctt gat aaa aeg gtt aaa cce aga gta ttc gat ttg gat ate aat 2973 
Leu Leu Asp Lys Thr Val Lys Pro Arg Val Phe Asp Leu Asp He Asn 
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730 735 740 

aaa caa ttt tta ctg cgt cgt gac cat cgt gag ata aat att tat ctt 3021 
Lys Gin Phe Leu Leu Arg Arg Asp His Arg Glu lie Asn lie Tyr Leu 
745 750 755 760 

tta ggt gaa gga aat ttt atg gat agg acg acg aca gat aaa aat eta 3069 
Leu Gly Glu Gly Asn Phe Met Asp Arg "Thr Thr Thr Asp Lys Asn Leu 
765 770 775 

ttc gag tta aat gag gat ggt tea eta ttt att aag acg tta cgc cat 3117 
Phe Glu Leu Asn Glu Asp Gly Ser Leu Phe lie Lys Thr Leu Arg His 
780 785 790 

get ett ggt aaa tat gtt get att aat ect tea act acg caa ttt ate 3165 
Ala Leu Gly Lys Tyr Val Ala lie Asn Pro Ser Thr Thr Gin Phe lie 
795 800 805 

ttc ttt gea caa gga aag tac agt gaa ttt ate atg aat gee tta aag 3213 
Phe Phe Ala Gin Gly Lys Tyr Ser Glu Phe He Met Asn Ala Leu Lys 
810 815 820 

aca gtt gaa gac gaa tta tea aaa cgt tat ega gtc aga att att cct 3261 
Thr Val Glu Asp Glu Leu Ser Lys Arg Tyr Arg Val Arg He He Pro 
825 830 835 840 

gaa ttg caa ggg ccg tat tat ggc ttt gaa ctt gat att ctt tct att 3309 
Glu Leu Gin Gly Pro Tyr Tyr Gly Phe Glu Leu Asp lie Leu Ser He 
845 850 855 

aca get taa ttcacaatat tatggagagt gtt atg gaa aag aaa ata aca aca 3362 
Thr Ala Met Glu Lys Lys He Thr Thr 

860 865 

ttt ace att gag aaa act gat gac aat ttt tat get aat ggg cgt cat 3410 
Phe Thr He Glu Lys Thr Asp Asp Asn Phe Tyr Ala Asn Gly Arg His 
870 875 880 

caa tgt atg gta aaa ate tct gta ett aaa caa gaa tat agg aat ggt 3458 
Gin Cys Met Val Lys He Ser Val Leu Lys Gin Glu Tyr Arg Asn Gly 
885 890 895 

gat tgg ata aaa tta gea ctt agt gag get gaa aaa aga teg att eag 3506 
Asp Trp He Lys Leu Ala Leu Ser Glu Ala Glu Lys Arg Ser He Gin 
900 905 910 

gtg gcg gea tta agt gat age etc ata tat gac caa tta atg cct 3554 
Val Ala Ala Leu Ser Asp Ser Leu He Tyr Asp Gin Leu Lys Met Pro 
915 920 925 930 

tea ggt tgg aca acg aca gat gea aga aat aaa ttt gat ctt ggg tta 3602 
Ser Gly Trp Thr Thr Thr Asp Ala Arg Asn Lys Phe Asp Leu Gly Leu 
935 940 945 

tta aat ggt gtt tat cat get gat get ttt att gac gaa eag gta aca 3650 
Leu Asn Gly Val Tyr His Ala Asp Ala Phe He Asp Glu Gin Val Thr 
950 955 960 

gat cgt gcg gga gat tgc tgc aca aat gaa aac tat eag aac agt gtg 3698 
Asp Arg Ala Gly Asp Cys Cys Thr Asn Glu Asn Tyr Gin Asn Ser Val 
965 970 975 

aaa agt gtt cct gaa att ate tat cgt tat gtc agt agt aat aga aca 3746 
Lys Ser Val Pro Glu He He Tyr Arg Tyr Val Ser Ser Asn Arg Thr 
980 985 990 
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age aca gaa tac eta atg gca aaa atg aca ttt gaa gat aeg gat ggg 3794 
Ser Thr Glu Tyr Leu Met Ala Lys Met Thr Phe Glu Asp Otir Asp Gly 
995 1000 1005 1010 

aaa cgc aea tta aea acg aat atg tea gtt ggt gat gaa gtt ttt gac 3842 
Lys Arg Thr Leu Thr Thr Asn Met Ser Val Gly Asp Glu Val Phe Asp 
1015 1020 1025 

age aag gtt tta tta aaa gee att get cct tat gca att aat aca aat 3890 
Ser Lys Val Leu Leu Lys Ala lie Ala Pro Tyr Ala lie Asn Thr Asn 
1030 1035 1040 

caa ttg cat gaa aac ate aat aca ttg ttt gat aaa aca gaa gag ccg 3938 
Gin Leu His Glu Asn lie Asn Thr Leu Phe Asp Lys Thr Glu Glu Pro 
1045 1050 1055 

aca aaa tec gat act cat cat caa ata att aat ctt tat cgc tgg aca 3986 
Thr Lys Ser Asp Thr His His Gin lie lie Asn Leu Tyr Arg Trp Thr 
1060 1065 1070 

ttg cca tat cat ttg agg att ctt gaa ggg aat gac agt act gtt aat 4034 
Leu Pro lyr His Leu Arg lie Leu Glu Gly Asn Asp Ser Thr Val Asn 
1075 1080 1085 1090 

aga ata tat gtc ctt ggt aaa gag cca tea aat gat aga ttc ctg aca 4082 
Arg lie Tyr Val Leu Gly Lys Glu Pro Ser Asn Asp Arg Phe Leu Thr 
1095 1100 1105 

aga gga agg gta ttt aaa cga gga act eat atg tga atgcacgtga 4128 
Arg Gly Arg Val Phe Lys Arg Gly Thr His Met 





1110 


1115 


taatgtgagt 


ggaggatgtg 


ttatggacta tgcttatacc gtaactatte cggacacgca 4188 


gcttgctgct 


gaagtgctte 


atgtgacagg gtgttcgtgg acgagtggtt attatgatgg 4248 


atatcatgat 


gtcacaatca 


ttgataacta cggttgtcag cataaattta gaatttctte 4308 


ggttaatatt 


ggacgtgcgc 


taagcatagc gagaataagt tgattttcct tagtaaaaaa 4368 


cccttgttta 


tgctggtaaa 


cgcatgtgcg tttgccagca attaatatat tccattattg 4428 


aaataggaat 


atagccatat 


ctgtaattat aeataaacga atttttactc gaatataatt 4488 


ttaattgatc 


aaacaggaaa 


tttaaa atg aaa get ace gat ata tat tec aat 4541 
Met Lys Ala Thr Asp lie Tyr Ser Asn 



1120 1125 

get ttt aat ttc ggt tet tat att aat act ggt gtc gat ece aga aea 4589 
Ala Phe Asn Phe Gly Ser Tyr lie Asn Thr Gly Val Asp Pro Arg Thr 
1130 1135 1140 

ggt caa tat agt gea aat att aat att ate aeg tta aga cet aat aat 4637 
Gly Gin Tyr Ser Ala Asn lie Asn He lie Thr Leu Arg Pro Asn Asn 
1145 1150 1155 

gtg ggt aat teg gaa caa aca ttg age eta tea ttc teg cca tta aca 4685 
Val Gly Asn Ser Glu Gin Thr Leu Ser Leu Ser E*ie Ser Pro Leu Thr 
1160 1165 1170 1175 

acg tta aac aat ggc ttt ggt att gge tgg cgc ttt tea tta aca aca 4733 
Thr Leu Asn Asn Gly Phe Gly He Gly Trp Arg Phe Ser Leu Thr Thr 
1180 1185 1190 
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tta gat ata aaa aca ctt aca ttt age cga gca aat ggg gag caa ttt 4781 
Leu Asp lie Lys Thr Leu Hhr Phe Ser Arg Ala Asn Gly Glu Gin PhB 
1195 1200 1205 

aa a tgt aag cca ttg ccg cct aat aat aat gat ctt agt ttt aaa gat 4829 
Lys Cys Lys Pro Leu Pro Pro Asn Asn Asn Asp Leu Ser Phe Lys Asp 
1210 1215 1220 

aaa aaa eta gat ttg cgc gta tat aag etc gat age aat act ttt 4877 
Lys Lys Leu Lys Asp Leu Arg Val Tyr Lys Leu Asp Ser Asn Ibr Phe 
1225 1230 1235 

tat gtt tat aac aaa aac ggc att ata gag ata ctt aaa cga att ggg 4925 
lyr Val Tyr Asn Lys Asn Gly lie lie Glu lie Leu Lys Arg lie Gly 
1240 1245 1250 1255 

teg agt gat att gca aaa aca gtt gca ctt gaa ttt cct gat ggt gaa 4973 
Ser Ser Asp lie Ala Lys Ihr Val Ala Leu Glu Phe Pro Asp Gly Glu 
1260 1265 1270 

gca ttt gat tta att tat aat tea aga ttt gca ttg tec gaa ata aaa 5021 
Ala Phe Asp Leu lie Tyr Asn Ser Arg Phe Ala Leu Ser Glu lie Lys 
1275 1280 1285 

tac cgt gtg aca ggt aaa act tat ett aaa etc aat tac tct gga aat 5069 
Tyr Arg Val Thr Gly Lys Thr Tyr Leu Lys Leu Asn Tyr Ser Gly Asn 
1290 1295 1300 

aac tgt aca tea gtg gaa tac cct gat gat aat aat att tct gcg aaa 5117 
Asn Cys Thr Ser Val Glu Tyr Pro Asp Asp Asn Asn lie Ser Ala Lys 
1305 1310 1315 

ata gca ttc gat tat cgt aac gat tac ctt att acg gtg act gta cct 5165 
lie Ala Phe Asp Tyr Arg Asn Asp Tyr Leu lie Thr Val Thr Val Pro 
1320 1325 1330 1335 

tac gat get tct ggt cct att gat tct gee cga ttt aag atg acc tat 5213 
Tyr Asp Ala Ser Gly Pro lie Asp Ser Ala Arg Phe Lys Met Thr Tyr 
1340 1345 1350 

eag aca tta aaa ggc gta ttt cca gtt ate age ace ttc cgt aca cca 5261 
Gin Thr Leu Lys Gly Val Phe Pro Val lie Ser Ihr Phe Arg Thr Pro 
1355 1360 1365 

acc ggt tat gtt gag ctg gtg agt tat aaa gag aat ggg cat aaa gtg 5309 
Thr Gly Tyr Val Glu Leu Val Ser Tyr Lys Glu Asn Gly His Lys Val 
1370 1375 1380 

acg gac acg gaa tat att cct tat gcg get gca etc act att caa ecc 5357 
Thr Asp Thr Glu Tyr lie Pro Tyr Ala Ala Ala Leu Thr lie Gin Pro 
1385 1390 1395 

ggc aat gga caa cct gcg gte age aaa tec tat gaa tat agt tea gta 5405 
Gly Asn Gly Gin Pro Ala Val Ser Lys Ser Tyr Glu Tyr Ser Ser Val 
1400 1405 1410 1415 

cat aac ttc ttg ggc tat tct tct ggc egg aca age ttt gat tec agt 5453 
His Asn Ehe Leu Gly Tyr Ser Ser Gly Arg Thr Ser Phe Asp Ser Ser 
1420 1425 1430 

Cr^^ gat aat ttg tat ttg gtc aca ggg aaa tac act tat tea tec att 5501 
Gin Asp Asn Leu Tyr Leu Val Thr Gly Lys Tyr Thr Tyr Ser Ser He 
1435 1440 1445 

gaa egg gtt tta gat ggt caa agt gtg gtt tea gta ata gaa cga gta 5549 
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Glu Arg Val Leu Asp Gly Gin Ser Val Val Ser Val lie Glu Arg Val 
1450 1455 1460 

ttt aat aaa ttc cat tta atg acc aaa gaa gca aaa aca caa gat aat 5597 
Phe Asn Lys Phe His Leu Met Thr Lys Glu Ala Lys Ihr Gin Asp Asn 
1465 1470 1475 

aag aga att aca aca gaa att act tac aat gag gat eta tea aaa agt 5645 
Lys Arg He "Thr Itir Glu He Thr Tyr Asn Glu Leu Ser Lys Ser 
1480 1485 1490 1495 

ttc tea gag eaa eca gaa aat tta caa caa cct tct cgc gtg tta acc 5693 
Phe Ser Glu Gin Pro Glu Asn Leu Gin Gin Pro Ser Arg Val Leu Ihr 
1500 1505 1510 

cgt tat acg gat ata caa aca aat act tea cga gaa gag act gtc aat 5741 
Arg Tyr Thr Asp He Gin Thr Asn Ihr Ser Arg Glu Glu Ihr Val Asn 
1515 1520 1525 

att aaa agt gat gat tgg gga aat act eta ctt att act gag ace agt 5789 
He Lys Ser Asp Asp Trp Gly Asn Thr Leu Leu He Ihr Glu Ihr Ser 
1530 1535 1540 

ggg ata cag aaa gaa tac gtt tat tat ccg gtc aat ggc gaa ggt aat 5837 
Gly He Gin Lys Glu Tyr Val Ty^^ Tyr Pro Val Asn Gly Glu Gly Asn 
1545 1550 1555 

agt tgc cct gee gat ccc ttg ggt ttt tct egg ttc tta aaa tea gtt 5885 
Ser Cys Pro Ala Asp Pro Leu Gly Phe Ser Arg Phe Leu Lys Ser Val 
1560 1565 1570 1575 

acg caa aaa gga teg cct gat get get caa agt gtc gca aat f^aa gtg 5933 
Thr Gin Lys Gly Ser Pro Asp Ala Ala Gin Ser Val Ala Asn Lys Val 
1580 1585 1590 

att cat tat aca tat caa aaa ttt cct act ttt acc ggc get tat gtt 5981 
He His Tyr Ihr Tyr Gin Lys Phe Pro Thr Phe Thr Gly Ala Tyr Val 
1595 1600 1605 

aag gaa tat gtc agt aaa gtc tea gag acg ata gae aat aaa ata gcg 6029 
Lys Glu Tyr Val Ser Lys Val Ser Glu Ihr He Asp Asn Lys He Ala 
1610 1615 1620 

aga acc ttt age tat gtt aac tea ccg acg agt aaa tct cat ggt teg 6077 
Arg Ihr Phe Ser Tyr Val Asn Ser Pro Ihr Ser Lys Ser His Gly Ser 
1625 1630 1635 

tta gca aaa ata acg tea gtg atg aat aac cag eaa acg gtc acc aca 6125 
Leu Ala Lys He Thr Ser Val Met Asn Asn Gin Gin Thr Val Thr Thr 
1640 1645 1650 1655 

ttt aaa tat gaa tat tea gaa agt gag atg acc aca aat get acg gtg 6173 
Phe Lys Tyr Glu Tyr Ser Glu Ser Glu Met Thr Thr Asn Ala Thr Val 
1660 1665 1670 

acc ggt ttt gat ggc gca cat atg gaa teg aaa aat gtg acg tct att 6221 
Thr Gly Phe Asp Gly Ala His Met Glu Ser Lys Asn Val Ihr Ser He 
1675 1680 1685 

tat ace cat egg caa ctt cgt aaa gtt gat gta aac cac gtg att acc 6269 
Tyr Thr His Arg Gin Leu Arg Lys Val Asp Val Asn His Val He Thr 
1690 1695 1700 

gat cag tct tat gat ctt ttg ggt cgc att aca ggg caa att att gat 6317 
Asp Gin Ser Tyr Asp Leu Leu Gly Arg He Thr Gly Gin He He Asp 
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1705 1710 1715 

ccc ggc acg gca aga gaa act aaa cgt aat tac gtt tat caa tat ccc 6365 
Pro Gly Thr Ala Arg Glu lie Lys Arg Asn Tyr Val TVr Gin Tyr Pro 
1720 1725 1730 1735 

ggc ggt gac gaa aat gat ttt tgg ccg gtg atg ata gaa gtt gat tct 6413 
Gly Gly Asp Glu Asn Asp Phe Trp Pro Val Met lie Glu Val Asp Ser 
1740 1745 1750 

caa ggc gtc aga cgt aaa acc cat tac gat gga atg gga cgt att tgt 6461 
Gin Gly Val Arg Arg Lys Tta* His Tyr Asp Gly Met Gly Arg lie Cys 
1755 1760 1765 

teg att gaa gaa caa gat gat gat ggc gcc tgg ggc aca teg ggg att 6509 
Ser lie Glu Glu Gin Asp Asp Asp Gly Ala Trp Gly Tte Ser Gly lie 
1770 1775 1780 

tat caa ggc aca tat cga aaa gtt ctt gcc aga caa tat gat gtt ttg 6557 
Tyr Gin Gly Thr Tyr Arg Lys Val Leu Ala Arg Gin Tyr Asp Val Leu 
1785 1790 1795 

ggg cag ttg age aag gaa att tea aat gat tgg tta tgg aat tta tct 6605 
Gly Gin Leu Ser Lys Glu lie Ser Asn Asp Trp Leu Trp Asn Leu Ser 
1800 1805 1810 1815 

gcc aat cct ttg gtt cgt ctt get aec ceg ttg gtt aca acg aaa acc 6653 
Ala Asn Pro Leu Val Arg Leu Ala Thr Pro Leu Val Thr Thr Lys Thr 
1820 1825 1830 

tat aaa tat gat ggt tgg gga aat ctt tac age acg gaa tac agt gat 6701 
Tyr Lys Tyr Asp Gly Trp Gly Asn Leu Tyr Ser Thr Glu Tyr Ser Asp 
1835 1840 1845 

ggt egg ata gag ctg gaa ate eat gat cet att acg agg aca att act 6749 
Gly Arg lie Glu Leu Glu lie His Asp Pro lie Thr Arg Thr lie Thr 
1850 1855 1860 

caa ggg gtc aaa gga tta ggg atg tta aat att eag caa aat aat ttt 6797 
Gin Gly Val Lys Gly Leu Gly Met Leu Asn lie Gin Gin Asn Asn Phe 
1865 1870 1875 

gag caa ccg get teg ate aaa get gtg tat cct gat ggt acg ata tat 6845 
Glu Gin Pro Ala Ser lie Lys Ala Val Tyr Pro Asp Gly Thr lie Tyr 
1880 1885 1890 1895 

age acc cgt act tat cgt tat gat gga ttt ggt cgt aca gtg acg gaa 6893 
Ser Thr Arg Thr Tyr Arg Tyr Asp Gly I*ie Gly Arg Thr Val Thr Glu 
1900 1905 1910 

aca gat gca gaa ggt cat get acc caa att gga tat gat gtg ttt gat 6941 
Thr Asp Ala Glu Gly His Ala Thr Gin lie Gly Tyr Asp Val Phe Asp 
1915 1920 1925 

cgt ata gtg aaa aaa acg ttg cca gac gga aca ata tta gaa tec get 6989 
Arg lie Val Lys Lys Thr Leu Pro Asp Gly Thr lie Leu Glu Ser Ala 
1930 1935 1940 

tat gca age ttt age cat gaa gaa tta att teg gca ctg aac gtg aat 7037 
Tyr Ala Ser Phe Ser His Glu Glu Leu lie Ser Ala Leu Asn Val Asn 
1945 1950 1955 

ggc aca eag ttg ggg gea tta gtt tat gat ggt ctt ggg egg gta ata 7085 
Gly Thr Gin Leu Gly Ala Leu Val Tyr Asp Gly Leu Gly Arg Val He 
1960 1965 1970 1975 
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agt gat acg gtg ggt ggt cgc aaa acg gaa tat tta tat ggg cct caa 
Ser Asp Thr Val Gly Gly Arg Lys Ihr Glu Tyr Leu Tyr Gly E>ro Gin 
1980 1985 1990 



7133 



ggt gac aaa ccg att cag tea att act cct teg cat aat aag caa aat 
Gly Asp Lys Pro lie Gin Ser lie Ihr Pro Ser His Asn Lys Gin Asn 
1995 2000 2005 



7181 



atg gat tac etc tac tat ctt ggt agt gtg atg tec aaa ttt ace acg 
Met Asp Tyr Leu Tyr Tyr Leu Gly Ser Val Met Ser Lys Phe Thr Ttir 
2010 2015 2020 



7229 



ggg aca gac caa caa aac ttt cgt tat cat teg aaa acg gga aca tta 
Gly Thr Asp Gin Gin Asn Phe Arg Tyr His Ser Lys Thr Gly Thr Leu 
2025 2030 2035 



7277 



tta tct gcg tea gaa ggc gta tet cag act aat tac agt tat ttc cca 
Leu Ser Ala Ser Glu Gly Val Ser Gin Thr Asn Tyr Ser Tyr Phe Pro 
2040 2045 2050 2055 



7325 



teg ggt gta tta cag ega gaa tea ttt tta egg gat aat aaa ccg att 
Ser Gly Val Leu Gin Arg Glu Ser Phe Leu Arg Asp Asn Lys Pro lie 
2060 2065 2070 



7373 



tea teg ggc gag tac ctt tat acg atg tec ggt ttg att caa cgt cat 
Ser Ser Gly Glu Tyr Leu Tyr Thr Met Ser Gly Leu lie Gin Arg His 
2075 2080 2085 



7421 



aaa gat agt ttt ggt cat aat cat gtt tat agt tac gat get cag gga 
Lys Asp Ser Phe Gly His Asn His Val Tyr Ser Tyr Asp Ala Gin Gly 
2090 2095 2100 



7469 



aga ttg gtc aaa aca gaa cag gat gca caa tac get aca ttt gaa tat 
Arg Leu Val Lys Thr Glu Gin Asp Ala Gin Tyr Ala Thr Phe Glu Tyr 
2105 2110 2115 



7517 



gac aat gtt ggg ega ttg ata aca acg acg ace aaa gac acg acg tea 
Asp Asn Val Gly Arg Leu lie Thr Thr Thr Thr Lys Asp Thr Thr Ser 
2120 2125 2130 2135 



7565 



tta tee caa tta gtg aca aaa ate gaa tat gat get ttt gat ega gaa 
Leu Ser Gin Leu Val Thr Lys lie Glu Tyr Asp Ala Phe Asp Arg Glu 
2140 2145 2150 



7613 



ata aaa cgc teg eta att agt gac ttc tea ata caa gtt att acc tta 
He Lys Arg Ser Leu He Ser Asp Phe Ser He Gin Val He Thr Leu 
2155 2160 2165 



7661 



age tat acg aag aat aat caa ate agt caa cgt ate acc tec ate gat 
Ser Tyr Thr Lys Asn Asn Gin He Ser Gin Arg He Thr Ser He Asp 
2170 2175 2180 



7709 



ggg gtg gtt atg aaa aat gaa cgt tat caa tat gat aat aat caa cgc 
Gly Val Val Met Lys Asn Glu Arg Tyr Gin Tyr Asp Asn Asn Gin Arg 
2185 2190 2195 



7757 



tta age caa tac caa tgt gag gga gaa caa tct ccg att gat cat acg 
Leu Ser Gin Tyr Gin cys Glu Gly Glu Gin Ser Pro He Asp His Thr 
2200 2205 2210 2215 



7805 



ggt cgt gta tta aat cag cag att tac cat tat gac caa tgg gga aat 
Gly Arg Val Leu Asn Gin Gin He Tyr His Tyr Asp Gin Trp Gly Asn 
2220 2225 2230 



7853 
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att aag egg etc gat aat aca tat cga gat ggt aag gaa acg gtg gat 7901 
lie Lys Arg Leu Asp Asn Thr Tyr Arg Asp Gly Lys Glu Thr Val Asp 
2235 2240 2245 

tat cat ttc agt caa gcc gat cca act caa ctt att cgt att acc age 7949 
lyr His Phe Ser Gin Ala Asp Pro Ihr Gin Leu lie Arg lie Thr Ser 
2250 2255 2260 

gac aaa cag cag ata gag tta agt tat gat get aat ggc aac eta aca 7997 
Asp Lys Gin Gin lie Glu Leu Ser Tyr Asp Ala Asn Gly Asn Leu Ihr 
2265 2270 2275 

cgt gac gaa aaa ggg caa acg etc att tac gat cag aat aat egc ttg 8045 
Arg Asp Glu Lys Gly Gin Thr Leu lie Tyr Asp Gin Asn Asn Arg Leu 
2280 2285 2290 2295 

gta cag gtc aaa gac egg ttg ggc aat etg gtg tgc age tac cag tat 8093 
Val Gin Val Lys Asp Arg Leu Gly Asn Leu Val Cys Ser Tyr Gin Tyr 
2300 2305 2310 

gat gea ttg aac aaa tta ace gea cag gtt ttg gcg aat ggt ace gtt 8141 
Asp Ala Leu Asn Lys Leu Thr Ala Gin Val Leu Ala Asn Gly Ihr Val 
2315 2320 2325 

aat cga cag cat tat get tee ggt aaa gtg acg aat att caa ttg ggt 8189 
Asn Arg Gin His Tyr Ala Ser Gly Lys Val Ihr Asn lie Gin Leu Gly 
2330 2335 2340 

gat gaa gcg att act tgg ttg age agt gat aag caa cga att gga cat 8237 
Asp Glu Ala lie Ihr Trp Uesa Ser Ser Asp Lys Gin Arg lie Gly His 
2345 2350 2355 

caa age gcc aag aat ggt caa tea gtc tac tat caa tat ggt att gac 8285 
Gin Ser Ala Lys Asn Gly Gin Ser Val Tyr Tyr Gin Tyr Gly lie Asp 
2360 2365 2370 2375 

eat aac agt acg gtt ate gcc agt cag aac gaa aac gag ttg atg get 8333 
His Asn Ser Thr Val lie Ala Ser Gin Asn Glu Asn Glu Leu Met Ala 
2380 2385 2390 

tta tee tat aca cet tat ggc ttt agg agt tta att tec tea tta ccg 8381 
Leu Ser Tyr Thr Pro Tyr Gly Phe Arg Ser Leu lie Ser Ser Leu Pro 
2395 2400 2405 

ggt ttg aat ggc gea cag gtt gat cca gta aca ggc tgg tac ttc tta 8429 
Gly Leu Asn Gly Ala Gin Val Asp Pro Val Thr Gly Trp Tyr Hie Leu 
2410 2415 2420 

ggt aac gga tat cgt gtt ttc aac ccg gtt etc atg agg ttt cac age 8477 
Gly Asn Gly Tyr Arg Val Phe Asn Pro Val Leu Met Arg Phe His Ser 
2425 2430 2435 

cec gat agt tgg agt cet ttt ggt egg gga ggg att aac cet tat acc 8525 
Pro Asp Ser Trp Ser Pro Phe Gly Arg Gly Gly lie Asn Pro Tyr Thr 
2440 2445 2450 2455 

tat tgc caa ggc gat cec ata aac egg att gat ctg aac ggt cat ctt 8573 
Tyr Cys Gin Gly Asp Pro lie Asn Arg lie Asp Leu Asn Gly His Leu 
2460 2465 2470 

agt gcc ggc ggg ata tta ggc att gtg eta ggg gea att ggc ate att 8621 
Ser Ala Gly Gly He Leu Gly He Val Leu Gly Ala He Gly He He 
2475 2480 2485 

gtc ggg att gta tea ctg gga gcc gga gcg gcg att age gcg ggt etc 8669 
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Val Gly lie Val Ser hen Gly Ala Gly Ala Ala He Ser Ala Gly Leu 
2490 2495 2500 

att get gcg ggg ggc get ttg ggg gcg att get tct acc age gcg ctt 8717 
He Ala Ala Gly Gly Ala lieu Gly Ala He Ala Ser Thr Ser Ala Leu 
2505 2510 2515 

gca gtt act gcg act gtc att gga ttg get gcc gat teg ata ggg att 8765 
Ala Val Thr Ala Thr Val He Gly Leu Ala Ala Asp Ser He Gly He 
2520 2525 2530 2535 

gcg tea gca gca tta teg gaa aaa gat ceg aaa aea tct ggg ata tta 8813 
Ala Ser Ala Ala Lieu Ser Glu Lys Asp Pro Lys Ihr Ser Gly He Leu 
2540 2545 2550 

aat tgg att agt gcg gga ttg ggg gtt tta age ttt ggt ate age gea 8861 
Asn Trp He Ser Ala Gly Leu Gly Val Leu Ser Phe Gly He Ser Ala 
2555 2560 2565 

ata acc ttt ace tet teg etg gta aaa teg gca egg agt ggt tct eag 8909 
He Thr Phe Thr Ser Ser Leu Val Lys Ser Ala Arg Ser Gly Ser Gin 
2570 2575 2580 

gca gtc age gcg ggt gtt ate ggg tea gtg cct ctt gaa ttt ggt gaa 8957 
Ala Val Ser Ala Gly Val He Gly Ser Val Pro Leu Glu Phe Gly Glu 
2585 2590 2595 

gtt get age cgt tec age aga ega tgg gat att gcg tta tct teg ata 9005 
Val Ala Ser Arg Ser Ser Arg Arg Trp Asp He Ala Leu Ser Ser He 
2600 2605 2610 2615 

teg ttg ggc gca aat gcg gcg tet etc tet aeg ggg ata gcg gcg gcg 9053 
Ser Leu Gly Ala Asn Ala Ala Ser Leu Ser Thr Gly He Ala Ala Ala 
2620 2625 2630 

gcg gtt gca gac agt aat gcg aat gca get aat att ctg gga tgg gta 9101 
Ala Val Ala Asp Ser Asn Ala Asn Ala Ala Asn He Leu Gly Trp Val 
2635 2640 2645 

tec ttt ggt ttt ggt gca gta teg aea ace tea gga ata att gag ctt 9149 
Ser Phe Gly Phe Gly Ala Val Ser Thr Thr Ser Gly He He Glu Leu 
2650 2655 2660 

aog cgt aca get tat gca gtg aat cat cag act tgg gaa ctg agt tea 9197 
Thr Arg Thr Ala lyr Ala Val Asn His Gin Ihr Trp Glu Leu Ser Ser 
2665 2670 2675 

tea gea ggt act teg gag gaa gtg aag ect ata cgt tgt etc gtt tea 9245 
Ser Ala Gly Thr Ser Glu Glu Val Lys Pro He Arg Cys Leu Val Ser 
2680 2685 2690 2695 

cae cgc tgg aat eag aag eag tga atgttaaccc tcctcgggca gttgagttaa 9299 
His Arg Trp Asn Gin Lys Gin 
2700 

tcaaacgttt cgaaatagta cegggaacta tttagccaat cgtceattga aacccgtaat 9359 
gtgttgcgac gtcgtttgae aatataaaga ttctgcgaac cgattggtta agtctcacga 9419 
aaaataacta ttaggcgaca tttgcgtcge cttttttaag gaaetttatc aggttacatt 9479 
tataagaagc tattttgttt tegaeggatg ttggtttetc tgagataaaa aatagaggga 9539 
aatgatgtca agggtgataa tggttaattg taaaatatgt gatattattc gcatttatat 9599 
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gtcaatgtaa ttcctcttat tatttaattt tattgcattt gctacgcgaa atcgccttat 9659 
aattttattt ttaataaatt attatttcat cattaaacta aaataaatta tttctaga 9717 



<210> 2 
<211> 417 
<212> PRT 

<213> Photorhabdus luminescens 
<400> 2 

Met Gin Arg Ala Gin Arg Val Val He Thr Gly Met Gly Ala Val Thr 
15 10 15 

Pro He Gly Glu Asp Val Glu Ser Cys Trp Gin Ser lie lie Glu Lys 
20 25 30 

Gin His Arg Phe His Arg He Glu Phe Pro Asp Ser Phe He Asn Ser 
35 40 45 

Arg Phe Phe Ser Phe Ijeu Ala Pro Asn Pro Ser Arg Tyr Gin Leu Leu 
50 55 60 

Pro Lys Lys Leu Thr His Ihr Leu Ser Asp Cys Gly Lys Ala Ala Leu 
65 70 75 80 

Lys Ala Thr Tyr Gin Ala Phe Thr Gin Ala Phe Gly Val Asn He Ser 
85 90 95 

Pro VclL Glu Tyr Tyr Asp Lys Tyr Glu Cys Gly Val He Leu Gly Ser 
100 105 110 

Gly Trp Gly Ala He Asp Asn Ala Gly Asp His Ala Cys Gin lyr Lys 
115 120 125 

Gin Ala Lys Leu Ala His Pro Met Ser Asn Leu He Ibr Met Pro Ser 
130 135 140 

Ser Met Thr Ala Ala Cys Ser He Met T/r Gly Leu Arg Gly Tyr Gin 
145 150 155 160 

Asn Thr Val Met Ala Ala Cys Ala Thr Gly Thr Met Ala He Gly Asp 
165 170 175 

Ala Phe Glu He He Arg Ser Gly Arg Ala Lys Cys Met He Ala Gly 
180 185 190 

Ala Ala Glu Ser Leu Thr Arg Glu Cys Asn He Trp Ser He Asp Val 
195 200 205 

Leu Asn Ala Leu Ser Lys Glu Gin Ala Asp Pro Asn Leu Ala Cys Cys 
210 215 220 

Pro Phe Ser Leu Asp Arg Ser Gly Phe Val Leu Ala Glu Gly Ala Ala 
225 230 235 240 

Val Val Cys Leu Glu Asn Tyr Asp Ser Ala He Ala Arg Gly Ala Thr 
245 250 255 

He Leu Ala Glu He Lys Gly Tyr Ala Gin lyr Ser Asp Ala Val Asn 
260 265 270 

Leu Thr Arg Pro Thr Glu Asp He Glu Pro Lys He Leu Ala He Ihr 
275 280 285 
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Lys Ala He Glu Gin Ala Gin He Ser Pro Lys Asp He Asp lyr He 
290 295 300 

Asn Ala His Gly Thr Ser Thr Pro Leu Asn Asp lisu Tyr Glu Ihr Gin 
305 310 315 320 

Ala He Lys Ala Ala Leu Gly Gin Tyr Ala Tyr Gin Val Pro He Ser 
325 330 335 

Ser Ttir Lys Ser Tyr Thr Gly His Leu He Ala Ala Ala Gly Ser Etie 
340 345 350 

Glu Thr He Val Cys Val Lys Ala Leu Ala Glu Asn Cys Leu Pro Ala 
355 360 365 

Thr Leu Asn Leu His Arg Ala Asp Pro Asp Cys Asp Leu Asn Tyr Leu 
370 375 380 

Pro Asn Gin His Cys Tyr Thr Ala Gin Pro Glu Val Thr Leu Asn He 
385 390 395 400 

Ser Ala Gly Phe Gly Gly His Asn Ala Ala Leu Val He Ala Lys Val 
405 410 415 



Arg 



<210> 3 
<211> 253 
<212> POT 

<213> Photorhabdos luminescens 

<400> 3 

Met Glu Asp He Glu His Trp Ser Asn Phe Ser Gly Asp Phe Asn Pro 
15 10 15 

He His Tyr Ser Ala Lys Ser Glu Ser Leu Arg Asn He Gin Gin His 
20 25 30 

Pro Val Gin Gly Met Leu Ser Leu Leu Tyr Val Arg Gin Gin Phe Ser 
35 40 45 

Gin Leu Thr Ser Ala Phe Ite Ttir Gly He Leu Asn He Asp Ala Ser 
50 55 60 

Phe Arg Gin Tyr Val Tyr Thr Ala Leu Pro His Gin Leu Arg He Asn 
65 70 75 80 

Thr Lys Asn Lys Thr Phe Lys Leu Glu Asn Pro Ser Lys Glu Asn Thr 
85 90 95 

Leu Phe Gly Asn Ihr Ser Val Glu Asn Thr Met Glu Ser He Glu Asp 
100 105 110 

Trp He Val Gin Asp Asn Cys Gin Lys Leu Thr He Thr Gly Glu Glu 
115 120 125 

Val Cys Glu Lys Tyr Ala Val Phe Arg Tyr Tyr Ete Pro Ser Val Thr 
130 135 140 

Ser He Gly Trp Phe Leu Asp Ala Leu Ala Phe His Leu He He Asn 
145 150 155 160 

Ser Thr Gly Phe Leu Asn Phe Glu His Tyr His Phe Asn Gin Leu Gin 
165 170 175 
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Asp lyr Leu Ser Gin Ser Phe Ihr Leu His Tfar Gly Gin Ala lie Lys 
180 185 190 

lie Arg Lys Glu lie Val Asn Ser Ihr Val Leu Leu Ser Ser Pro Asp 
195 200 205 

lie Cys Val Glu Leu Asn Pro Pro Leu Ijeu lie Lys Asn Gly Asp Lys 
210 215 220 

Asp Tyr lie Arg He Phe Tyr Tyr Arg Cys Leu Tyr Asp Lys Lys Pro 
225 230 235 240 

He Phe Val Ser Lys Ihr Ser He He Ser Lys Met Lys 
245 250 



<210> 4 
<211> 186 
<212> PRT 

<213> Photorhabdus lurainescens 
<400> 4 

Met Asn Val Leu Glu Gin Gly Lys Val Ala Ala Leu lyr Ser Ala Tyr 
15 10 15 

Ser Glu Ihr Glu Gly Ser Ser Trp Val Gly Asn Leu Q/s Cys Phe Ser 

20 25 30 

Ser Asp Arg Glu His Leu Pro He He Val Asn Gly Arg Arg Phe Leu 
35 40 45 

He Glu Phe Val He Pro Asp His Leu Leu Asp Lys Ihr Val Lys Pro 
50 55 60 

Arg Val Phe Asp Leu Asp He Asn Lys Gin Phe Leu Leu Arg Arg Asp 
65 70 75 80 

His Arg Glu He Asn He Tyr Leu Leu Gly Glu Gly Asn Phe Met Asp 
85 90 95 

Arg Ihr Thr Thr Asp Lys Asn Leu Phe Glu Leu Asn Glu Asp Gly Ser 
100 105 no 

I^u Phe He Lys ihr Leu Arg His Ala Leu Gly Lys Tyr Val Ala He 
115 120 125 

Asn Pro Ser Ihr Thr Gin Phe He Phe Phe Ala Gin Gly Lys Tyr Ser 
130 135 140 

Glu Phe He Met Asn Ala Leu Lys Thr Val Glu Asp Glu Leu Ser Lys 
145 150 155 160 

Arg Tyr Arg Val Arg He He Pro Glu Leu Gin Gly Pro Tyr Tyr Gly 
165 170 175 

Phe Glu Leu Asp He Leu Ser He Thr Ala 
180 185 



<210> 5 
<211> 258 
<212> PRT 

<213> Photorhabdus luminescens 
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<400> 5 

Met Glu Lys Lys lie Ttir 'Thr E*ie Thr He Glu Lys Thr Asp 
1 5 10 

Asp Asn Phe Tyr Ala Asn Gly Arg His Gin Cys Met Val Lys He Ser 
15 20 25 30 

Val Leu Lys Gin Glu Tyr Arg Asn Gly Asp Trp He Lys Ala Leu 
35 40 45 

Ser Glu Ala Glu Lys Arg Ser He Gin Val Ala Ala Leu Ser Asp Ser 
50 55 60 

Leu He Tyr Asp Gin Leu Lys Met Pro Ser Gly Trp ite TInr Ihr Asp 
65 70 75 

Ala Arg Asn Lys Phe Asfp Leu Gly Leu Leu Asn Gly Val Tyr His Ala 
80 85 90 

Asp Ala Phe He Asp Glu Gin Val Ihr Asp Arg Ala Gly Asp Cys Cys 
95 100 105 110 

'Hhr Asn Glu Asn Tyr Gin Asn Ser Val Lys Ser Val Pro Glu He He 
115 120 125 

Tvr Arg Tyr Val Ser Ser Asn Arg Thr Ser Thr Glu Tyr Leu Met Ala 
130 135 140 

Lys Met Thr Glu Asp Thr Asp Gly Lys Arg Thr Leu Thr Thr Asn 
145 150 155 

Met Ser Val Gly Asp Glu Val Phe Asp Ser Lys Val Leu Leu Lys Ala 
160 165 170 

He Ala Pro Tyr Ala He Asn Ihr Asn Gin Leu His Glu Asn He Asn 
175 180 185 190 

Thr Leu Phe Asp Lys Thr Glu Glu Pro Thr Lys Ser Asp Thr His His 
195 200 205 

Gin He He Asn Leu Tyr Arg Trp Thr Leu Pro Tyr His Arg He 
210 215 220 

Leu Glu Gly Asn Asp Ser Thr Val Asn Arg He Tyr Val Leu Gly Lys 
225 230 235 

Glu Pro Ser Asn Asp Arg Phe Leu Thr Arg Gly Arg Val Phe Lys Arg 
240 245 250 

Gly Thr His Met 
255 



<210> 6 
<211> 1584 
<212> PRT 

<213> Photorhabdus luminescens 

<400> 6 

Met Lys Ala Ihr Asp He Tyr Ser Asn Ala Phe Asn Phe Gly Ser Tyr 
15 10 15 

He Asn Thr Gly Val Asp Pro Arg Ihr Gly Gin Tyr Ser Ala Asn He 
20 25 30 
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Asn lie He Thr Lieu Arg Pro Asn Asn Val Gly Asn Ser Glu Gin Thr 
35 40 45 

Leu Ser Leu Ser Phe Ser Pro Leu Tlir Thr Leu Asn Asn Gly Phe Gly 

50 55 60 

lie Gly Trp Arg Phe Ser Leu Ihr Thr Leu Asp He Lys Thr Leu Thr 
65 70 75 80 

Phe Ser Arg Ala Asn Gly Glu Gin Phe Lys cys Lys Pro Leu Pro Pro 
85 90 95 

Asn Asn Asn Asp Leu Ser Phe Lys Asp Lys Lys Leu Lys Asp Leu Arg 
100 105 110 

Val Tyr Lys Leu Asp Ser Asn Ihr Phe Tyr Val Tyr Asn Lys Asn Gly 
115 120 125 

He He Glu He Leu Lys Arg He Gly Ser Ser Asp He Ala Lys Thr 
130 135 140 

Val Ala Leu Glu Phe Pro Asp Gly Glu Ala Phe Asp Leu He Tyr Asn 
145 150 155 160 

Ser Arg Phe Ala Leu Ser Glu He Lys Tyr Arg Val Thr Gly Lys Ihr 
165 170 175 

lyr Leu Lys Leu Asn Tyr Ser Gly Asn Asn Cys Ito Ser Val Glu Tyr 
180 185 190 

Pro Asp Asp Asn Asn He Ser Ala Lys He Ala Phe Asp Tyr Arg Asn 
195 200 205 

Asp Tyr Leu He Thr Val Thr Val Pro Tyr Asp Ala Ser Gly Pro He 
210 215 220 

Asp Ser Ala Arg Phe Lys Met Ihr Tyr Gin Ttar Leu Lys Gly Val Phe 
225 230 235 240 

Pro Val He Ser Thr Ite Arg Thr Pro Thr Gly Tyr Val Glu Leu Val 
245 250 255 

Ser Tyr Lys Glu Asn Gly His Lys Val Thr Asp Ihr Glu Tyr He Pro 
260 265 270 

Tyr Ala Ala Ala Leu Thr He Gin Pro Gly Asn Gly Gin Pro Ala Val 

275 280 285 

Ser Lys Ser Tyr Glu Tyr Ser Ser Val His Asn Phe Leu Gly Tyr Ser 
290 295 300 

Ser Gly Arg Thr Ser Phe Asp Ser Ser Gin Asp Asn Leu Tyr Leu Val 
305 310 315 320 

Ihr Gly Lys Tyr Thr Tyr Ser Ser He Glu Arg Val Leu Asp Gly Gin 
325 330 335 

Ser Val Val Ser Val He Glu Arg Val Phe Asn Lys Phe His Leu Met 
340 345 350 

Ihr Lys Glu Ala Lys Thr Gin Asp Asn Lys Arg He Thr Ihr Glu He 
355 360 365 

Thr Tyr Asn Glu Asp Leu Ser Lys Ser Phe Ser Glu Gin Pro Glu Asn 
370 375 380 



BNSDOCID: <WO 99425e9A2_l_> 



wo 99/42589 PCT/EP99/01015 

- 18- 



Leu Gin Gin Pro Ser Arg Val Leu Thr Arg Tyr Thr Asp lie Gin Thr 
385 390 395 400 

Asn Ihr Ser Arg Glu Glu Thr Val Asn lie Lys Ser Asp Asp Trp Gly 
405 410 415 

Asn Itir Leu Leu lie Thr Glu Thr Ser Gly lie Gin Lys Glu Tyr Val 
420 425 430 

Tyr lyr Pro Val Asn Gly Glu Gly Asn Ser Cys Pro Ala Asp Pro Leu 
435 440 445 

Gly Phe Ser Arg Phe Leu Lys Ser Val Thr Gin Lys Gly Ser Pro Asp 
450 455 460 

Ala Ala Gin Ser Val Ala Asn Lys Val lie His Tyr Thr Tyr Gin Lys 
465 470 475 480 

Phe Pro Thr Phe Thr Gly Ala Tyr Val Lys Glu Tyr Val Ser Lys Val 
485 490 495 

Ser Glu Thr lie Asp Asn Lys He Ala Arg Thr Phe Ser Tyr Val Asn 
500 505 510 

Ser Pro Thr Ser Lys Ser His Gly Ser Leu Ala Lys He Thr Ser Val 
515 520 525 

Met Asn Asn Gin Gin Thr Val Thr Thr Phe Lys Tyr Glu Tyr Ser Glu 
530 535 540 

Ser Glu Met Thr Thr Asn Ala Thr Val Thr Gly Phe Asp Gly Ala His 
545 550 555 560 

Met Glu Ser Lys Asn Val Thr Ser lie Tyr Thr His Arg Gin Leu Arg 
565 570 575 

Lys Val Asp Val Asn His Val He Thr Asp Gin Ser Tyr Asp Leu Leu 
580 585 590 

Gly Arg He Thr Gly Gin He He Asp Pro Gly Thr Ala Arg Glu He 
595 600 605 

Lys Arg Asn Tyr Val Tyr Gin Tyr E>ro Gly Gly Asp Glu Asn Asp Phe 
610 615 620 

Trp Pro Val Met He Glu Val Asp Ser Gin Gly Val Arg Arg Lys Thr 
625 630 635 640 

His Tyr Asp Gly Met Gly Arg He Cys Ser He Glu Glu Gin Asp Asp 
645 650 655 

Asp Gly Ala Trp Gly Thr Ser Gly He Tyr Gin Gly Thr Tyr Arg Lys 
660 665 670 

Val Leu Ala Arg Gin Tyr Asp Val Leu Gly Gin Leu Ser Lys Glu He 

675 680 685 

Ser Asn Asp Trp Leu Trp Asn Leu Ser Ala Asn Pro Leu Val Arg Leu 
690 695 700 

Ala Thr Pro Leu Val Thr Thr Lys Thr Tyr Lys Tyr Asp Gly Trp Gly 
705 710 715 720 

Asn Leu Tyr Ser Thr Glu Tyr Ser Asp Gly Arg He Glu Leu Glu He 
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725 



730 



735 



His Asp Pro lie Thx Arg ihr lie Thr Gin Gly Val Lys Gly Leu Gly 
740 745 750 

Met Leu Asn lie Gin Gin Asn Asn Phe Glu Gin Pro Ala Ser lie Lys 
755 760 765 

Ala Val Tyr Pro Asp Gly Thr lie Tyr Ser Thr Arg Thr Tyr Arg Tyr 
770 775 780 

Asp Gly Rie Gly Arg Thr Val Thr Glu Thr Asp Ala Glu Gly His Ala 
785 790 795 800 

Thr Gin lie Gly Tyr Asp Val Phe Asp Arg lie Val Lys Lys Thr Leu 
805 810 815 

Pro Asp Gly Thr lie Leu Glu Ser Ala Tyr Ala Ser Phe Ser His Glu 
820 825 830 

Glu Leu lie Ser Ala Leu Asn Val Asn Gly Thr Gin Leu Gly Ala Leu 
835 840 845 

Val TVr Asp Gly Leu Gly Arg Val lie Ser Asp Hir Val Gly Gly Arg 
850 855 860 

Lys Thr Glu Tyr Leu lyr Gly Pro Gin Gly Asp Lys Pro lie Gin Ser 
865 870 875 880 

lie Thr Pro Ser His Asn Lys Gin Asn Met Asp Tyr Leu Tyr Tyr Leu 
885 890 895 

Gly Ser Val Met Ser Lys Phe Ihr Thr Gly Thr Asp Gin Gin Asn Phe 
900 905 910 

Arg Tyr His Ser Lys Thr Gly Thr Leu Leu Ser Ala Ser Glu Gly Val 
915 920 925 

Ser Gin Thr Asn Tyr Ser Tyr Phe E>ro Ser Gly Val Leu Gin Arg Glu 
930 935 940 

Ser Phe Leu Arg Asp Asn Lys Pro lie Ser Ser Gly Glu Tyr Leu Tyr 
945 950 955 960 

Thr Met Ser Gly Leu lie Gin Arg His Lys Asp Ser Kie Gly His Asn 
965 970 975 

His Val Tyr Ser Tyr Asp Ala Gin Gly Arg Leu Val Lys Thr Glu Gin 
980 985 990 

Asp Ala Gin Tyr Ala Thr Phe Glu Tyr Asp Asn Val Gly Arg Leu He 
995 1000 1005 

Thr Thr Thr Thr Lys Asp Thr Thr Ser Leu Ser Gin Leu Val Thr Lys 
1010 1015 1020 

He Glu Tyr Asp Ala Phe Asp Arg Glu He Lys Arg Ser Leu He Ser 
025 1030 1035 1040 

Asp Phe Ser He Gin Val He Thr Leu Ser Tyr Thr Lys Asn Asn Gin 
1045 1050 1055 

He Ser Gin Arg He Thr Ser He Asp Gly Val Val Met Lys Asn Glu 



1060 



1065 



1070 
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Arg Tyr Gin Tyr Asp Asn Asn Gin Arg Leu Ser Gin Tyr Gin Cys Glu 
1075 1080 1085 

Gly Glu Gin Ser Pro lie Asp His Thr Gly Arg Val Leu Asn Gin Gin 
1090 1095 1100 

lie lyr His Tyr Asp Gin Trp Gly Asn lie Lys Arg Leu Asp Asn Ihr 
105 1110 1115 1120 

Tyr Arg Asp Gly Lys Glu Thr Val Asp Tyr His Phe Ser Gin Ala Asp 
1125 1130 1135 

Pro Thr Gin Leu lie Arg lie Thr Ser Asp Lys Gin Gin lie Glu Leu 
1140 1145 1150 

Ser Tyr Asp Ala Asn Gly Asn Leu Thr Arg Asp Glu Lys Gly Gin Ttir 
1155 1160 1165 

Leu lie Tyr Asp Gin Asn Asn Arg Leu Val Gin Val Lys Asp Arg Leu 
1170 1175 1180 

Gly Asn Leu Val Cys Ser Tyr Gin Tyr Asp Ala Leu Asn Lys Leu Thr 
185 1190 1195 1200 

Ala Gin Val Leu Ala Asn Gly Thr Val Asn Arg Gin His Tyr Ala Ser 
1205 1210 1215 

Gly Lys Val Thr Asn lie Gin Leu Gly Asp Glu Ala lie Thr Trp Leu 
1220 1225 1230 

Ser Ser Asp Lys Gin Arg lie Gly His Gin Ser Ala Lys Asn Gly Gin 
1235 1240 1245 

Ser Val Tyr Tyr Gin Tyr Gly lie Asp His Asn Ser T!hr Val lie Ala 
1250 1255 1260 

Ser Gin Asn Glu Asn Glu Leu Met Ala Leu Ser Tyr Thr Pro Tyr Gly 
265 1270 1275 1280 

Phe Arg Ser Leu lie Ser Ser Leu Pro Gly Leu Asn Gly Ala Gin Val 
1285 1290 1295 

Asp Pro Val Hir Gly Trp Tyr Phe Leu Gly Asn Gly Tyr Arg Val E*ie 
1300 1305 1310 

Asn Pro Val Leu Met Arg Phe His Ser Pro Asp Ser Trp Ser Pro ftie 
1315 1320 1325 

Gly Arg Gly Gly lie Asn Pro Tyr Thr Tyr Cys Gin Gly Asp Pro lie 
1330 1335 1340 

Asn Arg lie Asp Leu Asn Gly His Leu Ser Ala Gly Gly lie Leu Gly 
345 1350 1355 1360 

He Val Leu Gly Ala He Gly He He Val Gly He Val Ser Leu Gly 
1365 1370 1375 

Ala Gly Ala Ala He Ser Ala Gly Leu He Ala Ala Gly Gly Ala Leu 
1380 1385 1390 

Gly Ala He Ala Ser Thr Ser Ala Leu Ala Val Thr Ala Thr Val He 
1395 1400 1405 

Gly Leu Ala Ala Asp Ser He Gly He Ala Ser Ala Ala Leu Ser Glu 
1410 1415 1420 
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Lys Asp Pro Lys Thr Ser Gly He Leu Asn Trp He Ser Ala Gly Leu 
425 1430 1435 1440 

Gly Val Leu Ser i*ie Gly He Ser Ala He Ihr Phe Thr Ser Ser Leu 
1445 1450 1455 

Val Lys Ser Ala Arg Ser Gly Ser Gin Ala Val Ser Ala Gly Val He 
1460 1465 1470 

Gly Ser Val Pro Leu Glu Phe Gly Glu Val Ala Ser Arg Ser Ser Arg 
1475 1480 1485 

Arg Trp Asp He Ala Leu Ser Ser He Ser Leu Gly Ala Asn Ala Ala 
1490 1495 1500 

Ser Leu Ser Thr Gly He Ala Ala Ala Ala Val Ala Asp Ser Asn Ala 
505 1510 1515 1520 

Asn Ala Ala Asn He Leu Gly Trp Val Ser Phe Gly Ete Gly Ala Val 
1525 1530 1535 

Ser Thr Thr Ser Gly He He Glu Leu Thr Arg Thr Ala Tyr Ala Val 
1540 1545 1550 

Asn His Gin Thr Trp Glu Leu Ser Ser Ser Ala Gly Thr Ser Glu Glu 
1555 1560 1565 

Val Lys Pro He Arg Cys Leu Val Ser His Arg Trp Asn Gin Lys Gin 
1570 1575 1580 



<210> 7 
<211> 18 
<212> mk 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence -.oligonucleotide 
<400> 7 

acacagcagg ttcgtcag 18 



<210> 8 

<211> 18 
<212> ENA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : oligonucleotide 
<400> 8 

ggcagaagca ctcaactc 18 



<210> 9 
<211> 20 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligonucleotide 
<400> 9 
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attgatagca cgcggcgacc 



<210> 10 
<211> 22 
<212> nsiA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligonucleotide 
<400> 10 

ttgtaacgtg gagccgaact gg 



<210> 11 
<211> 37948 
<212> CNA 

<213> Photortiabdus luminescens 

<220> 
<221> CDS 

<222> (15171) . . (18035) 
<223> orfS 

<220> 
<221> CDS 

<222> (23768) . . (31336) 
<223> hph2 

<220> 
<221> CDS 

<222> (31393) . . (35838) 
<223> orf2 

<400> 11 

tgttgctgga ccgtggagat tatgcctatc gtcagttaga acgagacacg ctcaatgaag 60 
ccaagatgtg gtatatgcaa gcactgcatc tgttaggcga taaacctcat ctatcgttca 120 
gttcagagtg gagcaaaccg agtttaggcg acgctgccgg aacggaaaga caaaagcaac 180 
acgcccaagc aatggccgct ctgcgacaag gtgatgttag tcggcacaac aacccgacag 240 
atcttttctt gccacaggtc aatgaagtga tgcaaaacta ttggcaaaaa ttggaacaac 300 
ggctgtataa cctgcgtcat aacctcacta ttgacggcca accgctacat ctgcctattt 360 
acgctacacc ggcagatcca aaagcattac ttagcgccgc tgtcgctagc tccgaaggtg 420 
gggtagctct ctcacagcca tttatgtcac tgtggcgttt cccacacatg ctggaaaacg 480 
cgcgtggtat ggtcagtcag ctcactcaat tcggctctac gctacaaaat attatcgaac 540 
gtcaggatgc ggaagcttta aacacgctct tgcagaatca agcagcagaa ctgatattga 600 
ctcatctcag catacaggac aaaaccatcg cagagctgga tgcggaaaaa atcgtactgg 660 
aaaaatccaa agccggggcg caatcacgct ttgacagcta caaaaagtta tacgacgaaa 720 
atatcaatgc gggtgaaaac cgggctatag cattgcatgc ctccgttgct ggcctcagca 780 
ctgccctgca agcatcacgt ctggcgggcg ctgcgcttga tctggcgccc aacattttcg 840 
gtctcgctga tggcggtagc cgttggggag cgattgccga agcgacaggt aatgttatgg 900 
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aattctcx:gc 


cagtgtgatg 


aacaccgaag 


gccgtcgccg 


tcaggaatgg 


gaaatccagc 


tcgatgctca 


acttcaatca 


ctggcagtac 


gccr^aaaac 


ccaacaggaa 


cagactcatg 


gtaatcaagc 


gttgtactac 


tggctacgcg 


acgatttggc 


cgtagcgcgt 


tgtctgatgg 


agaccgcggc 


aagctttatc 


aaacccggcg 


ccggt.gaaac 


cttgatgctg 


aatctggcgc 


aacgcgctct 


ggaagtggaa 


cggaccattt 


aaaaatogtt 


taatttagcc 


acacggattt 


ccattgatga 


tcatcccgtt 


acattagaaa 


caggtctgtc 


attagataat 


gactacccag 


ttaaacaaat 


cagtgtcacc 


ttgccagccc 


ttctgtccta 


cggaggaagt 


gaaatcggat 


ctcatgggat 


caatgacagt 


ggtcaattcc 


cgtttgaagg 


gattgcgatt 


aacgatactg 


actgtcaaac 


aagaaaacat 


gttgcagact 


accatccgcc 


aataaccacc 


tcaattaaat 


acttttcacg 


aatatatacc 


actcacagtc 


aacagagata 


tttccttttc 


tcgtgagtcc 


ctttttcata 


atatgcaata 


tgtaacaaat 


gccauactac 


ttatgatgta 


attataaatt 


aafl r.aaf aac 


gttgtgaaat 


agttgatagt 


coacwcaaiiu 


ticagaccgct. 


atcagacgtt 


qacQoccL.ua 


c ccacaccca 


cggcactcag 


fr* A As^vrv^ 

L- uririoagyaa 


tiaaacat^gaa 


cacactcaaa 


tttaataatc 


taaccgatat 


ttgtcatctc 


gacaaactct 


catggtcaca 


gacccaaagc 


gacaatctcc 


tgcatgaagc 


ccatattct^ 


gtccatcttg 


ccctgacaac 


acctcatgct 


aatcgcgcca 


gcaaatatac 


agccccaggc 


tatttagctg 


aactttatcg 


tcaggcacgg 



cggataaaat cagccagtxa gaagcctatc 960 
gtaatcatgc ogaagcagag ataaaacaga 1020 
gccgtgaagc cgcggtattg cagaaagcca 1080 
ctcaattgac tttcctgcaa cgtaaattca 1140 
gtcggctagc tgctatttac ttccaatttt 1200 
ctgaaatggc ttatcgttgg gagactaatg 1260 
cctggcaggg aacccatgcg ggtttactgg 1320 
aaatggaaga tgcccatttg aggtgggatc 1380 
cattgacgca acactatgga gcactgccag 1440 
ctaccctgct agcaggtggt acaactgact 1500 
acgaccaact tagtgcxaaa atctctctgt 1560 
atggcaacgg cgtaggcaac attcgacgca 1620 
tgttaggacc atatcaggat gtacaagcta 1680 
tagctgaaag ctgtaaatca ctggcgatct 1740 
agttggattt taacaatggt aagttcctgc 1800 
gcacattgac actcagtttc ccccaatgcg 1860 
ttgagtgata ttattctgca tattcgctat 1920 
accaaaaaca ggctcctaaa cggggcctga 1980 
tgctctcttt acctgtctga cgctcgttat 2040 
catcacctac tataaaatat caaccctctt 2100 
gcaattattt catttagtta ttgttaacta 2160 
ttgttattgc atcacaatag ccatttaaat 2220 
taaatggtgt ttttatttag ccgttatttt 2280 
acctgtgttg cctttgtttt gatagatata 2340 
accagcataa atgttttatt tacctaacat 2400 
tccgaatatg aaaacgcgtt agtagcaggt 2460 
tcttttgacg aacttogcaa aaaagtgaag 2520 
ttatatcttg aagcacagca agtgcaaaag 2580 
aaacgcgcca atcctcattt acaaagtgcg 2640 
gaccagcaag gttataatag cagatttggc 2700 
gcaatttctt ccatgttttc tcctgcggct 2760 
aatttacatg atgaaaattc tatttatcat 2820 
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ttggatacac ggcgtccgga tctaaaatca ttggtgctca gccagaaaaa tatggatacg 2880 
gagatttcca cactttctct gtctaatgac atgttgctag agggtattaa aactctgttc 2940 
aaggacaagc tgctggaggc tctgaagaat attaaatctc tgtccaagga cgagctgctg 3000 
gaggctttga agaatattaa acctctgtcc aaggacgatc tgctggaggc tttgaagaat 3060 
attaaacctc tgtccaagga cgatctgctg gaggctttga agaatattaa acctctgtcc 3120 
aaggacgatc tgctggaggc tttgaagaat attaaacctc tgtccaagga cgatctgctg 3180 
gaggctttga agaatattaa acctctgtcc aaggacgatc tgctggaggc tttgaagaat 3240 
attaaacctc tgtccaagga cgatctgctg gaggctttga agaatattaa acctctgtcc 3300 
aaggacgatc tacaggaatg tattgaaatt ctattcaatc tggacagcca cactaaagta 3360 
atgaaagcgt tatccaattt ccgcgtttct ggcatgatgc catatcacga tgcttatgaa 3420 
agcgtgcgta aggttgttca attacaggct ccggtgtttg aacacgttgt tagtacatca 3480 
ctagaaacga ctatcgatga actaaaatat caagcttctt tgttggaaat taattcttct 3540 
gtctcgccta aattatttac tatcttgact gaagaaatta ctacaatcaa tgcaagaagt 3600 
ctctatgagg aaaattttgg taatattaaa ccttctctaa taggaaaacc ggaatatctg 3660 
aaaagttatt acaatctgag tgatgaagag tttagcgatt tcattaaaat aagaactata 3720 
cttcttccag aagaagaaat agcaattact gatcttgcat cgcgtactac tagtacacaa 3780 
cagactatcg aaaatcctga ttatcgtgct ctattgaaaa ttaataagtt tattcgtcta 3840 
ttcaaagcta taaacttatc accgacggta ttaagtggaa tcctccgcag catcagcaca 3900 
gaattcaata tcaataaaga aatattacaa aaaatctttc gtgttaaata ctatatgcaa 3960 
cgttatggta ttgacactga gactgcatta atactatgca aggtaccaat ttcacaatat 4020 
atcaatgacg gacatctaag tcagtttgat cgtttattta attcccccaa actgaatggt 4080 
caagattttt ccgtcaatgg tactcagaat attgatttaa ccctaagcag taccaacaac 4140 
tggaataaaa cagtacttaa acgtgctttt aacctcgatg atatctcatt aaatcgacta 4200 
ctaaaaatta ccaatccggt caatactacc gaaatgataa ctaatgatat agagaatctt 4260 
tctcatctct ataggacaaa attactggca gatatccatc agttaactat tgatgaactg 4320 
gggttactgt tggaagccat aggtaaagga acaaccaatt tatctgagat tactcctgac 4380 
aatctggtta ctctaattaa caaactctat gctgtcacta gctggctacg tacacaaaag 4440 
tggagtgtct atcagttgtt tatgatgact actgataaat ataacaaaac cctaaccccg 4500 
gaaatacgga atttactgga taccgtctac aatggcttgc aagattttaa taagaagatg 4560 
ttgaaagctg aagaagatct agagaaaacc aaaaagaaat tgcagagcgc caaggaaaat 4620 
ctggaaaaat tcccggaaaa ccagccacaa ctccaagaag acaggaaaaa agcccagaga 4680 
agactgaata aagctgaaga gacccacgaa aaagccgaga aaaacctaga tgaggtcagg 4740 
aaaaatctgc caaaagccat atctccttat atcgccgccg ctctgcaatt accatctgaa 4800 
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catgcggcat attccatact catctgggca gataatctgg aacccjggcat aggaaaaatg 4860 
acagcggaaa aattatggaa ctggttgcgg aaaaatcccg ttacggctca acctgaattc 4920 
caaaaacaag ctgaacctgt ggtccagtat tgccagcgcc tggcacaact agcgttgatt 4980 
taccgttcta ccggccttaa cgaaaacacc ttaagtctgt ttgtgacaaa gccgcaacac 5040 
tttgttatta aaaccaaagc acccgaaaca actgaaacaa caccagcaca tgacgtatca 5100 
acactaatgt cactaacgcg ttttactgac tgggttaact cactaggtga aaaogcctct 5160 
tctgtactaa ccgaatttaa aaaaggaaca ttaacagcag aactattggc taaggctatg 5220 
aatcttgata aaaatctact ggagcaagcc aagattcagg caaaaactga tttgtccaac 5280 
tggccatcta tcgacaacct attgcagtgg attaacatct cacgtcaatt gaacatctct 5340 
ccacaaggca tttccacact gactcaagta ttgaccgcag aacctcccgc taactatacc 5400 
caatgggaaa acgccgctgc gatattaacc gccgggctgg acacccaaaa gactaacgcc 5460 
ctacatgcgt ttctggatga gtctcgcagt gctgcgttaa gcacatacta tatttattct 5520 
cataaccaaa aagatcgaga agcaagaaaa catacagtaa ttaaaaaccg tgatgatcta 5580 
tatcaatacc tattgatcga t^ccaagtt tccgccgaca ttaaaactac agagatcgct 5640 
gaagctatcg ctagtatcca actgtatatt aaccgcgcgt tgaaaaatat ggagggagat 5700 
actgtcacaa gtgtcaccag ccgctcattc ttcaccaact gggataaata caataaacgc 5760 
tacagcactt gggccggtat ggctaaactc ctttactatc cagagaatta catcgatccg 5820 
acgctacgta ttgggcagac aaaaatgatg gatacgttgc tgcaatccat cagccaaagc 5880 
caattaaata tcgataccgt agaagatgcc tttaaatctt acctaacatc attcgaacag 5940 
gtggctaatc tggaaatcct: cagcgcctac catgacaaca ttaataatga tcaaggatta 6000 
acttacttta tcggacgtag taaaacagaa gtgaatcaat attattggcg cagtgtggat 6060 
cacaataaat ccagcgaagg taaattcccc gctaatgcct ggagtgagtg gtacaaaatt 6120 
gattgtccaa ttaaccccta caaagatact attcacccgg taattttcca atctcgcctg 6180 
tatcttatxt ggctggaaca aaaaaaggcg actaaacagg aaggtgataa aaccgcctcg 6240 
ggttattatt atgaactgaa attagcgcat atccgttatg acggcacctg gaatacacca 6300 
gtcacctttg atgtaaacca aaaaatatcc gatttaaatc tgggaaataa aacacctgga 6360 
ctttactgct caagctttca aggcagagat gaattgctgg tgatgtttta taaaaaacaa 6420 
gatcaattaa atcaatacac aaacacagta ccaataaaag gactatatat cacttccaat 6480 
atgtcttcta aggaaatgac acctgaaaat caraaaccta acgcttataa acagtttgat 6540 
actaatagta ttattggtgt caataatcgc tatgcagaaa gctaogaaat cccttcatca 6600 
gtaaatagta ataacggtta tgattgggga gatggctatc tgagtatggt gtatggcgga 6660 
aatatttcag ccatcaaact. ggagtcctca tcagataagt taaaactctc accaaggtta 6720 
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agaattattc ataatggact tgtaggccga caacgcaacc aatgcaacct gatgaagaaa 6780 
tacggtcagc ttggtgataa atttattatt tatactactc taggtattaa ccccaataat 6840 
ttgtcgaata aaaaattcat ttacx:ctgtt tatcagtata gtgggaacac taccaataat 6900 
gagaaaggac gtctgctgtt ttatcgagaa agt^ctacta actttgtaag agcctggttc 6960 
cctaaccttc cctctggctc tcaagaaatg tccacaacca ctggcggtga cattagtggt 7020 
aactatggtt atattgataa caaacatagt gacgatgttc catttaaaca atatttctat 7080 
atggatgacc acggtggtat tgacactgat gtttcaggga tiattatctat taatacgaac 7140 
attaatcatt caaaagttaa agtaatagtg aaagccgaag gtatcacaga gcaaactttt 7200 
gtagcgagcg aaaacagtaa tgtccccacc aatccgtccc gcttcgaaga aatgaattat 7260 
cagtttaaag agcttgaaat agatatctcc acactgacat ttcacaataa tgaagcaagt 7320 
attgatatca cctttatcgc atttgctgag aaatttgacg ataatagtaa tgatcgtaac 7380 
ttaggcgaag aacatttcag tattcgtatt atcaaaaaag cggaaactga taatgccccg 7440 
accctgcacc ataatgcaaa cggggcgcaa tatatgcagt ggggaaactc ttgtattcgc 7500 
cttaatacgc tatttgcccg tcaattaatt agccgagcca acgcggggat agatactatt 7560 
ttgagcatgg acactcagaa tattcaggaa cctaaattag gagaagattc tcctgatgct 7620 
atggaaccaa tggacttcaa cggcgccaac agcctctatt tctgggaact gttctactac 7680 
accccgatgc tgattgctca acgtttgctg cacgaacaaa acttcgatga ggctaaccgt 7740 
tggctgaaat atgtctggaa cccatccggt tatattgtca atggtcaaac gcaacattac 7800 
cgctggaatg ttcgcccatt acaagaagac actagttgga acgatgatcc gttggattca 7860 
tttgatcctg ataccatagc tcaacatgat ccaatgcact acaaagtcgc cacctttatg 7920 
cgcaccctag atctgttgat cgaacgggga gattacgcct atcgccaatt ggagcgggac 7980 
acactcgctg aagccaaaat gtggtatatg caggcactgc atctattggg tgataaacct 8040 
catctaccac tcagttcagc atggaatgat ccagagctag aagaggccgc agctcttgaa 8100 
aaacaacagg cacatgccaa agaaatagca gatttacgac aaggacttcc tacatccaca 8160 
gggtctaaag atgaaatcaa aacagatctt ttcctgccgc aagtcaacga agtgatgctg 8220 
agctactggc agaaactaga acaacggttg tat^cctgc gccataacct ctctattgat 8280 
ggccaacctt tacatttgcc tattttcgca acaccagcag atccaaaagc gctgctcagc 8340 
gccgctgtcg ccagttcaca aggtggaagt aatcttccat cagaatttat atcagtgtgg 8400 
cgtttccctc atatgctgga aaacgcccgt agtatggtca gtcagctaac ccaattcggc 8460 
tccacattgc aaaatattat cgaacgtcaa gatgcggagg cattaaacac gctgttgcaa 8520 
aatcaggcgg cagaactgat attgaccaat ctcagcatac aggacaaaac catccaagag 8580 
ctggatgctg aaaaaactgt gctagaaaaa aaccgcgccg gaacccagtc gcgttttgat 8640 
agctacagca aattctacga tgaagacatc aacgcgggng aaaaacaggc aatggcgttg 8700 
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cgtgcttccg tcgctggcat: ctctacagcc cttcaagcat cacatctggc gggcgcagca 8760 
cttgatctgg cgcccaacat cttcggcttc gctgatggtg gcagccgttg gggggcgatc 8820 
gcc c aagcca caggtaatgt catggagttc tccgccagtg ttatgaacac cgaagcggat 8880 
aaaatcagcc aatctgaagc ctaccgtcgg cgtcgtcagg aatgggaaat tcagcgtaat 8940 
aacgccgagg cagagctgaa acaaatcgat gctcaacttg gttcgctggc agtgcgccgt 9000 
gaagccgcag tattgcagaa aaccagccta aaaacccaac aagagcagac tcatgcacaa 9060 
ccgaccttcc tgcaacataa gttcagtaat caggcgctgt acaactggct gcgtggtcga 9120 
ttgtccgcca tttacttcca gttctatgat ttaacggtag ctcgctgttt gatggcggaa 9180 
atggcctatc gctgggagac taacgatacc gcatcacgct ttatcaaacc cggcgcctgg 9240 
cagggaaccc atgcoggttt gctcgcgggt gaaaccttaa tgctgaatct ggcacagatg 9300 
gaagatgccc acctgaaaca ggataaacgc gtactggagg tagaacgtac cgtttcgctg 9360 
gcogaagtct atgccaaatt accgcaagat aaatttatcc tgactcagga aatagagaag 9420 
ttggtgagta aaggttcagg cagggccggc aaggacaata ataagctggc gtttagtacc 9480 
aataccaata cctctctaga agcgtccatt tcgttatcta ccttgaacat tagcagcgat 9540 
tatcctgatt ctattggtaa aacccgtcgt attaaacaga tcagcgttac cctgccagca 9600 
ctgctaggac cctatcagga tgtgcaagca attctgtctt acagcggaaa agcctctgaa 9660 
ttggctgaaa gttgcaaatc attagcggtt tctcatggga tgaatgacag cggtcagttc 9720 
caactggatt tcaacgatgg caaattcctg ccgttcgaag gaatcaaaat cgatgaaggt 9780 
acgctgacat tgagcttccc aaatgc3att agtaaagaag acaaaaaaga cgaaaaaggc 9840 
aaacaacaag ccatgctgga gagtctgaac gacatcattc tgcatattcg ctacaccatt 9900 
cgc r aataac gattttaatt aagtgctaaa acaggcccxrt aagcggggcc tgcaaggagt 9960 
ctttcatgca aaattcacaa gatttcagta ttacagaact atcattgcx:c aaaggaggag 10020 
gcgctatcac gggaatgggg gaagctttaa cccccaccgg gccggatggg atggccgcgc 10080 
tgtctctgcc gttgcctatc tctgccgggc gcggttatgc tccgtcactc gccttaaact 10140 
acaacagcgg cgccggtaac agcccatttg gtctgggctg ggattgcaac gttatgacca 10200 
tccgccgccg cacccatttt ggcgttccac attatgatga aaccgatacc tttctggggc 10260 
cagatggcga ggtactggtg gtagcggatc aatcccgcga cgaatcgaca ttacagggta 10320 
tcaacttagg caccgccttt accgttaccg gataccgttc ccgtctggag agtcatttca 10380 
gccgattgga atattggcaa cccaaggcaa cacccaagac aactggcaaa acagattttt 10440 
ggctgatata tagcccagat ggacaagtac atttactggg taaatcacca caagcccgga 10500 
tcagcaaccc gtcagacatc actcaaacag cacaatggtt gctagaagcc tctgtgtcac 10560 
cacatggtga acaaatttat tatcaatatc gggccgagga taacaccggt tgcgaagctg 10620 
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atgaaattac 


tctccatcca 


caggccgcog 


gcaaccggac 


agccagcaaa 


acgttacccg 


actggttatt 


ctatctggta 


tttgattacg 


cagcattttc 


gactacaggt 


agctggcttt 


atggtitttga 


gattcgtacc 


cgccgcttat 


aagctctgga 


tagcgagata 


aaagaacaca 


tcaattatga 


cgaaagcgca 


atcgccagca 


agcaagacgg 


tactgccgtc 


accctgccgc 


cgcaacataa 


cgctcgctgg 


caatcgatga 


gctggcaact 


agttgatcta 


aaaggcgaag 


gcgcctggtg 


gtaccgctcc 


gcacaacgtt 


gggaaaaaat 


gcaacx:tttg 


tcggttatcc 


atatcaatgg 


agacggccaa 


cttgactggg 


atagtcagca 


tccagatggc 


agttggacac 


aatatactca 


tccacgcgcg 


caactcgccg 


tactgatcgg 


ccctaagagt 


gtacgtttat 


gaaaagatgt 


agtgcaatcc 


ggtgatatca 


agttggtggc 


atttagtjgat 


gtattgggtt 


cgactaaagt 


cacctgctgg 


cctaatctgg 


ttccgggatt 


tagccaacca 


gaagcgacgt 


tagatggcag 


cggcccgact 


gatctgattt 


tgaataaaag 


cggcaaoggc 


ttcgccgcac 


gttttgatca 


tacctgtcag 


ttacaagtgg 


tgatattaag 


tgtgccgcat 


atgactcccc 


aaccgtggtt 


actcagtgaa 


atgaacaaca 


gtacrttccgc 


cc:3gttctgg 


ctggatgaaa 


cagtttgtta 


tctacccttc 


ccggtacaca 








agcgagaatt 


tcgcggattt 


ggttatgttg 


gcagtgcgcc 


agaatgcaca 


ccacctgcac 


cacccgcgct 


gacccaaggc 


aacgctccag 


acgcgcctga 


actcacatca 


cccgcgctga 


ccgcgctggc 


ccaaggcaat 


gcgccagaac 
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cgcaacgt.ta tctacacaca gtgtattacg 10680 
gtctggatgg cagcgcccca ccacaagcag 10740 
gcgaacgcag taacaacctg agaacgccgc 10800 
gtcgccagga ccgtttttcc cgttatgaat 10860 
gccgtcaggt attgatgtat caccacctgc 10920 
acggaccaac gctggtttca cgcctgatac 10980 
cgctggtatt cgttcgtcga gtaggccacg 11040 
cattagaatt ggcgtatcag gatttttcac 11100 
atgtgctggc aaacttcaat gccattcagc 11160 
gattcxsrcgg tctgctatat caagataaag 11220 
ttggcaaaat tggctcagat gccgtcactt 11280 
cttccttgca aagtaatgcc tcgccggtgg 11340 
ttatcaccgg accgggatta cggggatatc 11400 
gttttacccc gctcaacgct ctgccagtgg 11460 
atttaatggg agct^^actt tctgatttag 11520 
atgccaatac ccgcgacggc tttgccaaag 11580 
cactgccagt accgggcgcc gatccgtgta 11640 
ccggtcaggc acatctggtt gaagtgagcg 11700 
ggcacggacg ttttggtcaa ccaattactc 11760 
ttaatcctgc tcaagtttat ctggccgatc 11820 
atgttcacac agatcgtctg gatatcttcc 11880 
cagtaactct ccccttccca gccggagtgc 11940 
ccgatgtaca agggttaggc gtcgccagcc 12000 
atcactggcg ttgcgatctg accaacacaa 12060 
atatgggggc tcatcacacc ctgcgntacc 12120 
aagccacggc actggatgcc ggacaaatac 12180 
ccctatggca aacggaaata gaggatgaaa 12240 
gttatgcaca tggcgcctgg gatggacgtg 12300 
aacagaaaga cagccatcaa ctggcccaag 12360 
tgacccaagg caacgcgcct gaactcacat 12420 
aactcacacc acctgcgatg acccaaagca 12480 
cccaaggcaa cgcgccagaa ttcacatcac 12540 
tcacaccacc tgcgatgacc aaaaactggt 12600 
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atgccaccgg aatacccatg atagataaca cattatcgac agagtattgg catggtgatc 12660 
accaagcttt tgccggtttt tcaccacgct ttacgacctg gcaagatggt caagatattc 12720 
tgctcacacc ggaaaatgat aacagtcagt actggctaaa ccgggcactg aaaggtcaac 12780 
tgctacgcag tgaactgtac ggcgaggatg gcagtacaca ggaaaaaatt ccctacacag 12840 
tcactgaatt tcgcccacag gtacgtcggt tacagcatac cgatagcx::ga taccttgtgc 12900 
tttggtcatc tgtagttgaa agccgcaact atcattacga acgtatcgcc agcgatcctc 12960 
aatgcagcca aaagattacg ctatccagog atctatttgg tcaacogcta aaacaggttt 13020 
cggtacagta tccacgccgc cagcaaccgg caagcagtcc gtatxrctgat acgttgcctg 13080 
ataagttatt tgctaacagc tatgatgacc agcaacacaa attacggctc acctatcaac 13140 
agttcagttg gcatcatctg accgacaata ccattctgat gttaggatta ccggatagta 13200 
cccgcagcga tatctttgct tatagcgctg aacatgtccc tactggtggt ctaaatctgg 13260 
aaatcctaaa tgataaaaat agtctgattg cggagaataa acctcgtgaa tacctoggcc 13320 
agcaaaaaac cgtttatacc gacgggcaaa atgcaacgcc atcgcaaacg ccaacacgac 13380 
aagcgctgat tgccttcacc gagacaacag tatttaatca atccacacta tcagcgtttg 13440 
atgggagtat ctcatctgct caattgtcaa cgacgctigga acaagccgga taccagcaaa 13500 
cagattatct attcccgcgc actggagaag ataaagtctg ggcagctcgt cgtggctata 13560 
ctgattacgg cacagccgaa cagttctggc ggccgcaaaa acagagcaac actcaactca 13620 
cgggcaaaat cacgctcact tgggatgcaa actattgcgt cgtcacacaa acccgggatg 13680 
cggctggact gacaacctca gccagatatg attggcgttt tctgaccccc gttcaactca 13740 
cggatatcaa cgacaatcag caccttacca cgctggatgc actgggccga ccaatcacac 13800 
tgcgcttttg gggaaccgaa aacggtaaga tgactggtta ttcttcaccg gaaaaaatat 13860 
cgttttctcc accatctgat gttgacgccg cgattaagtt aacaacgcca atccctgtag 13920 
cacagtgtca ggtctacgca cccgaaagct ggatgcccat attaaagaaa accctcaata 13980 
acctggcaga gcaagagcgg aaagagttat ataacacccg aatcatcacc gaagacggac 14040 
gcatctgtac cctagctcac cgccgctggg taaaaagcca aagtgcagtc acccagccaa 14100 
tcaatctgtc aaacggcagt cccx:gtttac cccx:tcatag cctcacattg actacggatc 14160 
gttatgaccg cgatcttaag caacagattc gtcaacaagt agtattcagt gatggctttg 14220 
gccgtttact gcaagcatct gtacgacatg aagcaggcga agcctggcaa cgtaaccaag 14280 
acggcgctct ggtgacaaaa atggaagata ccaaaacgcg ctgggcggtt acgggacgca 14340 
ctgaatatga caataaggga caaccgatac gcacctatca accctatttc ctcaacgact 14400 
ggcaatacgt cagtaatgac agtgcccggc ggacagaaga agcctatgca gatacccatg 14460 
tctatgatcc cattggtcga gaaatcaagg tcactaccgc aaaaggctgg ttccgtcgaa 14520 
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ccttgttcac tccttggttt: actgtcaatg aagatgaaaa tgacacagct actgaggtga 14580 

aggtaaagaa gaaagaatgt aaagaaggta aagaaggtaa agatgtaatt tgatcaatcc 14640 

cgcccggttg aagggcggga aacataacat aatatagagg tgaaacgtgt cattcataat 14700 

gccgtcagat actcaactta tgagttggtt gatcattggt tttattgcgg cc tggggcgg 14760 

attagtaagg tacctcattg atatacaaaa caaacaatgt aaatggaatt ggatcaacgt 14820 

actctgtcaa ctcattatct cctgttttac cggtatattg ggaggactgc tgagttttga 14880 

aagcggcggc agcccctata tgacttttgc gattgccggg ctatttggca ccacgggaag 14940 

ttctggattg aactggatct ggcgtcgcct ttttatgcat tatcgcgatg atggaggaaa 15000 

gcaataaggc attcccactg ccgcaaaaac catctgtctc cggcagttaa accgggaaat 15060 

cacctactac aactattgta agaaaacgaa tatatagaaa aactaacatg cagataaaaa 15120 

ctgcgattgc agaacagatg acacacaacg ccccaacaac gaggtaaatc atg aaa 15176 

Met Lys 
1 

aac ate gat cct aaa ctt tat caa aag acc cct gtc gtc aac ate tae 15224 
Asn lie Asp Pro Lys Leu Tyr Gin Lys 'Thr Pro Val Val Asn lie Tyr 
5 10 15 

gat aac cga ggt eta acg ate cgt aae ate gac ttt cac cgt aec acc 15272 
Asp Asn Arg Gly Leu Ita* lie Arg Asn lie Asp Phe His Arg Thr Ite 
20 25 30 

gca aac gge gat ace gat ate egt att act ege cat caa tat gac tec 15320 
Ala Asn Gly Asp Thr Asp lie Arg lie Thr Arg His Gin Tyr Asp Ser 
35 40 45 50 

ctt ggg cac eta age caa age acc gat ecg egt eta tat gaa gee aaa 15368 
Leu Gly His Leu Ser Gin Ser Thr Asp Pro Arg Leu Tyr Glu Ala Lys 
55 60 65 

caa aaa tet aac ttt etc tgg cag tat gat ttg acc ggt aat att ttg 15416 
Gin Lys Ser Asn Phe Leu Trp Gin Tyr Asp Leu Thr Gly Asn lie Leu 
70 75 80 

tgt aca gaa age gtc gat get ggt cgc act gtc acc ttg aat gat att 15464 
Cys Thr Glu Ser Val Asp Ala Gly Arg Thr Val Thr Leu Asn Asp lie 
85 90 95 

gaa gge egt ecg eta etg aca gta act gca aca ggt gtc ata caa ace 15512 
Glu Gly Arg Pro Leu Leu Thr Val Thr Ala Ttir Gly Val lie Gin Thr 
100 105 110 

cga caa tat gaa acg tet tee eta cec ggt egt etg ttg tet gtt ace 15560 
Arg Gin Tyr Glu Thr Ser Ser Leu Pro Gly Arg Leu Leu Ser Val Thr 
115 120 125 130 

gaa caa ata cea gaa aaa aca tec cgt ate acc gaa cgc etg att tgg 15608 
Glu Gin lie Pro Glu Lys Thr Ser Arg He Thr Glu Arg Leu He Trp 
135 140 145 

get gge aat age gaa gca gag aaa aac cat aat ctt gcc age cag tgc 15656 
Ala Gly Asn Ser Glu Ala Glu Lys Asn His Asn Leu Ala Ser Gin Cys 
150 155 160 

gtg cgc cac tat gac acg gcg gga gtc acc cga tta gag agt ttg tea 15704 
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Val Ary His Tyr Asp Ttir Ala Gly Val Thr Arg Leu Glu Ser Leu Ser 
165 170 175 

ctg acc ggt act gtt tta tct caa tec age caa eta ttg age gac act 15752 
Leu Thr Gly Thr Val Leu Ser Gin Ser Ser Gin Leu Leu Ser Asp Thr 
180 185 190 

caa gaa get age tgg aca ggt gat aat gaa ace gtc tgg caa aac atg 15800 
Gin Glu Ala Ser Trp Thr Gly Asp Asn Glu Thr Val Trp Gin Asn Met 
195 200 205 210 

ctg get gat gac ate tac aca acc ctg age gee ttt gat gee acc ggc 15848 
Leu Ala Asp Asp lie Tyr Thr Thr Leu Ser Ala PhB Asp Ala Thr Gly 
215 220 225 

get tta etc act cag ace gat gcg aaa ggg aac att cag agg eta acc 15896 
Ala Leu Leu Thr Gin Thr Asp Ala Lys Gly Asn lie Gin Arg Leu Thr 
230 235 240 

tat gat gtg gee ggg cag eta aac ggg age tgg tta ace tta aaa gac 15944 
Tyr Asp Val Ala Gly Gin Leu Asn Gly Ser Trp Leu Thr Leu Lys Asp 
245 250 255 

caa eeg gaa caa gtg att ate aga tec ctg acc tat tee gee gee gga 15992 
Gin Pro Glu Gin Val lie lie Arg Ser Leu Thr Tyr Ser Ala Ala Gly 
260 265 270 

caa aaa tta cge gag gaa cac ggc aat ggt gtt ate acc gaa tac agt 16040 
Gin Lys Leu Arg Glu Glu His Gly Asn Gly Val lie Thr Glu Tyr Ser 
275 280 285 290 

tat gaa ccg gaa ace caa cag ett ate ggt ace aaa acc cac cgt ccg 16088 
Tyr Glu Pro Glu Thr Gin Gin Leu lie Gly Thr Lys Thr His Arg Pro 
295 300 305 

tea gat gee aaa gtg ttg caa gat eta cgt tat gag tat gac ccg gta 16136 
Ser Asp Ala Lys Val Leu Gin Asp Leu Arg Tyr Glu Tyr Asp Pro Val 
310 315 320 

ggc aat gtc ate agt ate cgt aat gac gea gaa gee acc cge tte tgg 16184 
Gly Asn Val lie Ser lie Arg Asn Asp Ala Glu Ala Thr Arg Kie Trp 
325 330 335 

cac aat cag aaa gtg gcg ccg gaa aac act tat acc tac gac tee ttg 16232 
His Asn Gin Lys Val Ala Pro Glu Asn Thr Tyr Thr Tyr Asp Ser Leu 
340 345 350 

tat cag ctt ate age gea acc ggg cge gag atg gcg aat ata ggt cag 16280 
Tyr Gin Leu lie Ser Ala Thr Gly Arg Glu Met Ala Asn lie Gly Gin 
355 360 365 370 

caa agt aac caa ett cec tec etc acc eta cet tct gat aac aac acc 16328 
Gin Ser Asn Gin Leu Pro Ser Leu Thr Leu Pro Ser Asp Asn Asn Thr 
375 380 385 

tac acc aac tat acc cgt act tat act tat gac cgt ggc ggc aat ttg 16376 
Tyr Thr Asn Tyr Thr Arg Thr Tyr Thr Tyr Asp Arg Gly Gly Asn Leu 
390 395 400 

act aaa ate cag cac agt tea ccg gcg acg caa aac aac tac acc aca 16424 
Thr Lys lie Gin His Ser Ser Fro Ala Thr Gin Asn Asn Tyr Thr Thr 
405 410 415 

aac ate acg gtt tct aac egg age aat cge gea gta etc age act ctg 16472 
Asn lie Thr Val Ser Asn Arg Ser Asn Arg Ala Val Leu Ser Thr Leu 
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420 425 430 

acc gaa gat ccg gcg caa gta gat get tta ttt gat gca ggc gga cat 16520 
Thr Glu Asp Pro Ala Gin Val Asp Ala Leu Phe Asp Ala Gly Gly His 
435 440 445 450 

cag aac acg ttg ata tea gga caa aac ctg aac tgg aat aca cgc ggt 16568 
Gin Asn Thr Leu lie Ser Gly Gin Asn Leu Asn Trp Asn Hhr Arg Gly 
455 460 465 

gaa eta caa cat gtg aca ttg gtg aaa egg gae aag gge gcc aat gat 16616 
Glu Leu Gin His Val Itir Leu Val Lys Arg Asp Lys Gly Ala Asn Asp 
470 475 480 

gat egg gaa tgg tat cgc tat agt agt gae ggg aga agg ata tta aaa 16664 
Asp Arg Glu Trp Tyr Arg Tyr Ser Ser Asp Gly Arg Arg lie Leu Lys 
485 490 495 

ate aat gaa cag cag acc age age aac tct caa aca cag aga ata act 16712 
lie Asn Glu Gin Gin llir Ser Ser Asn Ser Gin Thr Gin Arg lie Hir 
500 505 510 

tat ttg ccg age tta gaa ctt cgt eta aca caa aac age acg ate aca 16760 
lyr Leu Pro Ser Leu Glu Leu Arg Leu Hhr Gin Asn Ser Ihr lie Ihr 
515 520 525 530 

acc gaa gat ttg caa gtt ate aca gta gga gaa gcg ggt egg gca cag 16808 
Thr Glu Asp Leu Gin Val lie Thr Val Gly Glu Ala Gly Arg Ala Gin 
535 540 545 

gta cga gta tta cat tgg gat age ggt caa ccg gaa gat ate gae aat 16856 
Val Arg Val Leu His Trp Asp Ser Gly Gin Pro Glu Asp lie Asp Asn 
550 555 560 

aat cag eta cgt tat age tac gat aat ctt ate ggt tec agt caa ett 16904 
Asn Gin Leu Arg Tyr Ser TVr Asp Asn Leu lie Gly Ser Ser Gin Leu 
565 570 575 

gaa tta gae age aaa gga gaa att att agt gag gaa gag tac tat cec 16952 
Glu Leu Asp Ser Lys Gly Glu lie lie Ser Glu Glu Glu Tyr Tyr Pro 
580 585 590 

tat ggc ggc acg gca tta tgg gca aca agg aag egg aca gaa gee agt 17000 
Tyr Gly Gly Thr Ala Leu Trp Ala Thr Arg Lys Arg Thr Glu Ala Ser 
595 600 605 610 

tat aaa ace ate cgt tat tea ggt aaa gag egg gat gee acc gga eta 17048 
lyr Lys *Itrr lie Arg Tyr Ser Gly Lys Glu Arg Asp Ala Thr Gly Leu 

615 620 625 

tat tat tac ggt tac cga tat tat cag cet tgg gta gga cga tgg tta 17096 
Tyr Tyr Tyr Gly Tyr Arg Tyr Tyr Gin Pro Trp Val Gly Arg Trp Leu 
630 635 640 

agt gee gat ccg gca gga aca gta gat ggg ttg aat tta tat egg atg 17144 
Ser Ala Asp Pro Ala Gly Thr Val Asp Gly Leu Asn Leu Tyr Arg Met 
645 650 655 

gta agg aat aat ccg gtt act ctg ctt gat cct gat gga tta atg cea 17192 
Val Arg Asn Asn Pro Val Thr Leu Leu Asp E>ro Asp Gly Leu Met Pro 
660 665 670 

aca att gca gaa cgc ata gca gca ctg caa aaa aat aaa gta gca gat 17240 
•nir He Ala Glu Arg He Ala Ala Leu Gin Lys Asn Lys Val Ala Asp 
675 680 685 690 
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tea gcg cct teg cca aca aat gcc aca aac gta gcg ata aac ate egc 17288 
Ser Ala Pro Ser Pro Thr Asn Ala Thr Asn Val Ala lie Asn lie Arg 
695 700 705 

ccg ccc gta gca cca aaa cct acc tta ccc aaa gca tea acg agt age 17336 
Pro Pro Val Ala Pro Lys Pro Thr Leu Pro Lys Ala Ser Thr Ser Ser 
710 715 720 

caa tea act aca tac ccc ate aaa tct gca age ata aaa cca acg acg 17384 
Gin Ser Thr Thr Tyr Pro lie Lys Ser Ala Ser lie Lys Pro Thr Thr 
725 730 735 

teg gga tea tee att act get cca ctg agt eca gta gga aat aaa tct 17432 
Ser Gly Ser Ser lie Thr Ala Pro Leu Ser Pro Val Gly Asn Lys Ser 
740 745 750 

act cct gaa ata tct ctt cca gaa age act caa age aat tct tea age 17480 
Thr Pro Glu lie Ser Leu Pro Glu Ser Thr Gin Ser Asn Ser Ser Ser 
755 760 765 770 

get att tea aca aat eta eag aaa aag tea ttt act tta tat aga gcg 17528 
Ala lie Ser Thr Asn Leu Gin Lys Lys Ser Phe Thr Leu Tyr Arg Ala 
775 780 785 

gat aat aga tec ttt gaa gac atg eag agt aaa ttc cct gaa gga ttt 17576 
Asp Asn Arg Ser Phe Glu Asp Met Gin Ser Lys ttie Pro Glu Gly Phe 
790 795 800 

aaa gee tgg act cct eta gat act aag atg gca agg eag ttt get agt 17624 
Lys Ala Trp Thr Pro Lai Asp Ihr Lys Met Ala Arg Gin Phe Ala Ser 
805 810 815 

gtc ttt att ggt eag aaa gat act tct aat tta cct aaa gaa aca gte 17672 
Val Phe lie Gly Gin Lys Asp Thr Ser Asn Leu Pro Lys Glu Thr Val 
820 825 830 

aag aat ata aac aca tgg gga aca aaa cca aaa tta aat gat etc tea 17720 
Lys Asn lie Asn Thr Trp Gly Thr Lys Pro Lys Leu Asn Asp Leu Ser 
835 840 845 850 

act tac ata aaa tat acc aag gac aaa tct aca gta tgg gtc tct act 17768 
Itir Tyr lie Lys Tyr Thr Lys Asp Lys Ser Thr Val Trp Val Ser Thr 
855 860 865 

gca att aat act gaa gca ggt gga caa agt tea ggg get cca etc cat 17816 
Ala lie Asn Thr Glu Ala Gly Gly Gin Ser Ser Gly Ala Pro Leu His 
870 875 880 

gaa att aat atg gat ctt tat gag ttt acc att gac gga caa aag eta 17864 
Glu lie Asn Met Asp Leu Tyr Glu Phe Thr lie Asp Gly Gin Lys Leu 
885 890 895 

aat cca eta cca agg gga aga tct aaa gac agg gtg cct tea eta tta 17912 
Asn Pro Leu Pro Arg Gly Arg Ser Lys Asp Arg Val Pro Ser Leu Leu 
900 905 910 

ctt gac aca cca gaa ata gaa aca gca tec ata att gca ctt aat cat 17960 
Leu Asp Tta* Pro Glu lie Glu Thr Ala Ser lie lie Ala Leu Asn His 
915 920 925 930 

gga ccg gta aat gat gca gaa gtt tea ttc eta aca aca att ccg ctt 18008 
Gly Pro Val Asn Asp Ala Glu Val Ser Phe Leu Ihr Ttir lie Pro Leu 
935 940 945 



BNSDOCID: <WO 9942589A2_L> 



wo 99/42589 



-34- 



PCT/EP99/01015 



aaa aat gta aaa cct tat aag aga taa cgaaaaatta atattcttta 18055 
Lys Asn Val Lys Pro Tyr Lys Arg 

950 955 

cccaccttta atagccctct tgaacttaca ctcaaggggg ggaaaccaaa taagaaacca 18115 

ncLCtaacaa caagccatga aagaatattt atttcatggc ttgattactt ttaacattca 18175 

ataccaaata attaaaacaa tatctaacca attaaaataa caatacctta tttatcatat 18235 

Laaaacatca aatcagaaat taatgaattt aagggttctt tatatttatt tctgagagca 18295 

caggcacaat accttaccga tggcgctgga cgtgattcaa aatccagaaa tgctatattt 18355 

ccaccaatat gggcagaata gcgcatttca ttgggagtca ttaaacttat cgcgacaccc 18415 

gcttttacca gatccaatct attagtaaaa tcagggaccg tcaataacgc taaattttgg 18475 

tattcaggga gataattcaa tggcataaaa ttattgcatt gttttaaaaa agcactatta 18535 

t-gccgaacaa aaggaaaact agatattatt tcatcagcgt gactttctgg ttctaaaata 18595 

tcatgggata cagcaagaga cagcatttga taagcaccat ctatcctgat gatatcatca 18655 

ttatctggat aacattcagt cgtcacataa actgttatat cccxtttcat taaggaagaa 18715 

aataccgcat cttgccttat taaatcatca attagaaaat tgttgattat acaaatatcg 18775 

cgacaatgat aaogttgcac cgctcttttt acgaccgtag atattttatt aacatattct 18835 

ccacttgtgc caataaccag tttgtctctg tttgataatt tataatttct acgacaattc 18895 

caattattct caactttcag gatcctttca taacacggca gcaactcttg atatagtgcc 18955 

tttccctctt ccgtgagctt ggtttttccc ggtagtcgct caaacaattg acaccccaca 19015 

cgctgtncca gttgatatac gagcctgcta agtggagaag gggtaataca aagcgtatcc 19075 

gccgctaacg tgaatgactc tttcttagct gattccataa aatactttag ttgctttgaa 19135 

caaaatatca tcacataccc tcttgttttc attccagaaa tagaatatta accatagaac 19195 

atgacaacga tgtttctact ttgcattctt ttacattagg acatgcgtta acggacattg 19255 

aatttcacta catcaattgt taatatttat ttaatacttg cacaataatt ataaaataaa 19315 

tataacttag ttaattattt cttgatattg atcatggtaa gttttcctca atacctacag 19375 

aagtagatat tattttatct tccagtaatc tatcgtttgg cgacggaggt cgattcttcc 19435 

attgggatat tcaacccatt cgccgccttt cttattaatt acagtgattt ttggcatttt 19495 

ggtttcatcc aacttaggtt tataggtgat tttccattta gcacccggtg ttaacttcaa 19555 

cctaaaggga tacataccaa cttcaccttg taagaatatt ctgtttggtc taccttcaac 19615 

gactttcaaa atggggtaaa taaccgggct aaaatcaatc gtatccaatg catcaatttc 19675 

gctgatattt gtccgggctg catcattgat aaatgcgatt aaatoggttg ctgaatacgg 19735 

aatagcatct ttcactagat gacggacatc ggtataactc actgacacaa aggctcggtc 19795 

aatcttccac ttacatcgac cgccaccatt aaaaggtagt tttgcctgaa agtaaccggt 19855 

cttcggatca gcttttacat ccagacgtaa tccgttataa gtcggtacct taaaaggcga 19915 
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catattggaa tctaaaogat atttaaggca atcttttgag atatacacag cggatacatg 19975 
cggctgtgtg tatttaggtg cgactccttc tacagtaatc cactgattct ctttgggagg 20035 
agagagcggc tcatttgggt cagcacagcc tgatattaaa atcacggata agacagataa 20095 
gtatttcttg atatttatca tggtaagttt tcctcaactc ctacagcgtt atctgcatgt 20155 
gtgtccaatt ccagatcttc ctgtttatct atttagaaat aaataagcta cgctgatagc 20215 
attacttcat atttccatac atgaatcgaa aatcgacttc ttgagtgccg ttatcaattt 20275 
tgccgcx:cgg atattcaacc cactcgccgc ctttcttatt agtxaccgtg accttcgcca 20335 
ttttggtttc atccagctta ggcttaaaaa taattttcca tttagctcct ggagttaacg 20395 
tgagttgaaa aggacgcatt tttaatactt caccttgtaa gaatattctg ttcgggcgac 20455 
cttcaacgac tttcaaaaca gggtaaataa ccgggctaaa atcaatcgta ttcaatgtcg 20515 
agattttgct aatattcatc tggactatgc cattgataga tgcgattaaa ccggttgctg 20575 
aatacggaat agcatctttc accagatggc tgacatcagt ataactcacc gatacaaagg 20635 
cccggttaat tttccattta catcgtcccc ctccactaaa aggtagtttt gcttgaaaat 20695 
aaccggtttg tggatcggcc ttcactttca gacgaagccc attataggtc ggcactttaa 20755 
aaggogacat attggaatcc agacgatact caaggcaatc ctttgatatg tattctgcgg 20815 
atacatgtgg ttcggtatat ttcggogcta ccccttctac cgtgatccat tgattttctt 20875 
taggagggga aagcggctca tttgggtcag cacagcctga tattaaaatc actgacaaga 20935 
caaataagta ttttttaaca tttatcatgg taagttttcc tcaattccta cagcattatc 20995 
cgcataaata tcctgtcaag aatagcgttc attgatttcg tcaccaaaga aacaagatag 21055 
taaaaatcct attaccacag ataaaaaaca ccgcttatgc cgtgagtaat agtgagttga 21115 
gcgacaggga tacagcagtg catccccatc aattagtccc tttgaataaa gggaacagaa 21175 
tttgaaattt ccgtcatacc gtccatatta cggaacttag attatgatta ttaaatcacc 21235 
accaaatggc aagaaaaatt ttcatttttt aatttacgaa gaatgaattt gtaagaaagt 21295 
gttacaaact taatagaaat taatttactg ttaatctaat gaaggatgaa attataaaaa 21355 
taacccattt ctcagggaca acaatccaca atatatagaa ccactggtcc tcacttaatt 21415 
tcctgtcagg agtagaaata tcctgatgac tcagtcgatg acatacagca atgtcattgg 21475 
tattgagact accgactgtt taataaatct cttttgtctt taatggcgag atacaagtga 21535 
ttcactattt aagcactatc gataaataag attccaaaat agcgccatat cttacaccac 21595 
tcataattct atgtataaca attggttaaa taggatcatg tgtaacagga ttatgaaacg 21655 
ttatttatac caaatctatc aattatttta tatatagttt cacagtcaca ctcgctatct 21715 
ggtaccttca taaccaactg ccctccctgc gctaccttct gataacaaca gctacactaa 21775 
ctatacccgc gcctataatt atgaccgtgt gaaaattcag cgtagttcac cggccacgca 21835 
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aaataactac acgaaatatt gctccccaga gaaacaccgt tcgaggttgt ttcaatgaaa 21895 
caLcaaggta gagacaccta tgtattatta caagatatta aaccctctgc gattactcat 21955 
aggaatgtac gtaatactta tacaggcaac ttcacgtcat ccagagaaaa ttaagttgta 22015 
caaaatagac atcaactaat atagtaatag aaaatcccct gaaaatagat tcaggggatt 22075 
taataaatta accaaaaatc ataataaaaa tttatttcat tattttagga taaatiattta 22135 
actagcccaa taatgaatta ttacttaaag taattcctaa acaatcaaat cggaaattaa 22195 
caaattcaat ggttcttgat atttatgcct gagagtataa gcacaatatt tcactgaggg 22255 
cgtcggatgc gatttaaaat tcaaaaaggt aatgccttta tgaaggtcag cagaacagag 22315 
cacttcattc ggtgtcatta aacttatcgc gatacctgat ttcactaagt ccaaccgatt 22375 
agt^aaatca ggcacagtta gcaaagttaa attctggtat tcaggtagac aattcagtga 22435 
aataaaattg ccgcactgtt taaagaaagc actattatgc tgagccaaag gcaatgtata 22495 
tataatatta tcagcgttac tttccgatcc taaaatatca ttagctacag caaggcgcaa 22555 
agtctgataa gttccatcca ctctgataat atcatcgttg tcaggataat gttcagtcgt 22615 
cacatagacg gttatatctt ccttcattaa agaagaaaat attgtctctt ttcttgccga 22675 
atcatcaacc agaaaattgt tttttatgta aatatcacga taatgataac gttgtaccgc 22735 
tctttttatc accgccgaaa ttttattaat atattctcca cttgtcccga tgaccagttt 22795 
gccggtgttt gctaattttc cgcttctacg ataatgccaa ttatcctcaa ctcgctgaat 22855 
tctctcataa cacggcaata gctcctgata taatgccttc ccctcttcag tgagtttggt 22915 
ccctcccggt agtcgttcaa atagttgaca gcccacacgt tgttccagtt gatatatgat 22975 
cctactcaga ggagagggag taacacaaag cgtatccgcg gctaaagtaa atgactcttt 23035 
tttcgctgac tccataaaat atttcaagct ctttgaacaa aatagcatca tacatccctc 23095 
ctattttiaat teat tgt tec atccgaaata gaatggaatg ttaaeaagaa aacattacaa 23155 
ctacttttet tctttgcatt atttaacatc aaagtatgea ttaactgaga ttgagtttta 23215 
teatctttat tcttaacagt tateaaacaa ttttcattat tattgcaaaa taaatacaac 23275 
cccttcttat gttaeaataa tgattataaa gaaatttcac atattatcat taagtaataa 23335 
tgggcacaat taaccattta attaaacatt tcaactggtt gacaaagact cattatgttc 23395 
aacatgtaat gagcgcaatt ttaacattaa ataaattaca tagttcatat tcattatcac 23455 
tgagatcagc ttttttcgta tagtacatca tgtgaacaat accgtgccat ttcctgccaa 23515 
atcttattaa aaagtcagtt gcaaattttg catctgcttt ttttgcaaca gctatttaaa 23575 
gaaaacagtg agatagtgat tatccgagag atcaagatat gtctgctctt tacgcacaaa 23635 
ctgcaaacca tttctatgca tatctcagct atttctcaaa acctgtattt aatcatctct 23695 
cattccgatg gaacggaatc attctctgat tgatccatga tgtaaagaca atatggatgt 23755 
tccatctact tt atg att tta aaa gga ata aat atg aat teg ccc gta aaa 23806 
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Met lie Leu Lys Gly lie Asn Met Asn Ser Pro Val Lys 
960 965 

gag ata cct gat gta tta aaa ate cag tgt ggt ttt cag tgt ctg aca 23854 
Glu He Pro Asp Val Leu Lys He Gin Cys Gly Phe Gin Cys Leu Thr 
970 975 980 

gat att age cae age tct ttt aac gaa ttt eac cag caa gta tec gaa 23902 
Asp He Ser His Ser Ser Phe Asn Glu Phe His Gin Gin Val Ser Glu 
985 990 995 1000 

cac etc tee tgg tec gaa gea eae gae tta tat cat gat gca caa cag 23950 
His Leu Ser Trp Ser Glu Ala His Asp Leu Tyr His Asp Ala Gin Gin 
1005 1010 1015 

gcc caa aag gat aat egg ctg tat gaa gcg cgt att ctt aaa cgc acg 23998 
Ala Gin Lys Asp Asn Arg Leu Tyr Glu Ala Arg He Leu Lys Arg Ihr 
1020 1025 1030 

aat cct caa tta caa aat get gta cat ctt gcc ate gta gcg cct aat 24046 
Asn Pro Gin Leu Gin Asn Ala Val His Leu Ala He Val Ala Pro Asn 
1035 1040 1045 

get gaa ctg ata gge tat aac aac caa ttt age gge agg gcc agt caa 24094 
Ala Glu Leu He Gly Tyr Asn Asn Gin Phe Ser Gly Arg Ala Ser Gin 
1050 1055 1060 

tat gte gcg ccg ggt aec gtt tec tec atg ttc tec ecc gee get tat 24142 
Tyr Val Ala Pro Gly Thr Val Ser Ser Met Phe Ser Pro Ala Ala Tyr 
1065 1070 1075 1080 

ttg act gag ctt tat cgt gaa gca cgc aat tta cac gcc age gat tec 24190 
Leu Thr Glu Leu Tyr Arg Glu Ala Arg Asn Leu His Ala Ser Asp Ser 
1085 1090 1095 

gtt tat cgc ctg gat act cgc cgc cca gat etc aaa tea atg gcg etc 24238 
Val Tyr Arg Leu Asp Thr Arg Arg Pro Asp Leu Lys Ser Met Ala Leu 
1100 1105 1110 

agt caa caa aat atg gat acg gaa ctt tec act etc tct tta tec aat 24286 
Ser Gin Gin Asn Met Asp Thr Glu Leu Ser Thr Leu Ser Leu Ser Asn 
1115 1120 1125 

gag eta tta ttg gaa age att aaa act gag tct aag ctg gat aat tat 24334 
Glu Leu Leu Leu Glu Ser He Lys Thr Glu Ser Lys Leu Asp Asn Tyr 
1130 1135 1140 

act caa gtg atg gaa atg etc tec get ttc cgt cct tec gge gcg acg 24382 
Thr Gin Val Met Glu Met Leu Ser Ala Phe Arg Pro Ser Gly Ala Thr 
1145 1150 1155 1160 

eet tat eae gat get tac gaa aat gtg cgt aaa gtt ate cag eta caa 24430 
Pro Tyr His Asp Ala Tyr Glu Asn Val Arg Lys Val He Gin Leu Gin 
1165 1170 1175 

gat cct ggg ctt gag caa tta aat get tea cca gee att gcc ggg ctg 24478 
Asp Pro Gly Leu Glu Gin Leu Asn Ala Ser Pro Ala He Ala Gly Leu 
1180 1185 1190 

atg cat caa get tec eta tta ggt att aac get tea ate tea cct gag 24526 
Met His Gin Ala Ser Leu Leu Gly He Asn Ala Ser He Ser Pro Glu 
1195 1200 1205 

ttg ttt aat att ctg acg gag gag att act gaa ggt aat get gag gaa 24574 
Leu Phe Asn He Leu Thr Glu Glu He Thr Glu Gly Asn Ala Glu Glu 
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1210 



1215 



1220 



ctt tat aag aaa aat ttt ggt aat ate gaa ccg get tea ctg get atg 
Leu Tyr Lys Lys Asn Phe Gly Asn lie Glu Pro Ala Ser Leu Ala Met 
1225 1230 1235 1240 



24622 



ccg gaa tac ctt aga cgt tat tac aat tta agt gat gaa gaa etc age 
E>ro Glu Tyr Leu Arg Arg Tyr Tyr Asn Leu Ser Asp Glu Glu Leu Ser 
1245 1250 1255 



24670 



cag ttt att ggt aaa gee age aat tte ggc caa caa gaa tat agt aat 
Gin Phe He Gly Lys Ala Ser Asn Phe Gly Gin Gin Glu Tyr Ser Asn 
1260 1265 1270 



24718 



aac caa etc att act ccg ata gte aac age aat gat ggc aca gtc aag 
Asn Gin Leu He Thr Pro He Val Asn Ser Asn Asp Gly Thr Val Lys 
1275 1280 1285 



24766 



gta tat ega att ace cge gaa tat aca aca aat gee aat caa gta gae 
Val Tyr Arg He Thr Arg Glu Tyr Thr Thr Asn Ala Asn Gin Val Asp 
1290 1295 1300 



24814 



gtg gag ctg ttt cce tac ggt gga gaa aat tat cag tta aat tac aaa 
Val Glu Leu Phe Pro Tyr Gly Gly Glu Asn Tyr Gin Leu Asn Tyr Lys 
1305 1310 1315 1320 



24862 



tte aaa gat tct cgt cag gat gtc tec tat tta tec ate aaa tta aat 
Phe Lys Asp Ser Arg Gin Asp Val Ser Tyr Leu Ser He Lys Leu Asn 
1325 1330 1335 



24910 



gae aaa aga gaa ctt ate cga att gaa gga gcg cct cag gtc aac ate 
Asp Lys Arg Glu Leu He Arg He Glu Gly Ala Pro Gin Val Asn He 
1340 1345 1350 



24958 



gaa tat tea gaa cat ate aca tta agt aca act gat ate agt caa cct 
Glu Tyr Ser Glu His He Thr Leu Ser Thr Thr Asp He Ser Gin Pro 
1355 1360 1365 



25006 



ttt gaa ate ggc eta aca cga gta tat cct tct agt tct tgg gca tat 
Phe Glu He Gly Leu Thr Arg Val Tyr Pro Ser Ser Ser Trp Ala Tyr 
1370 1375 1380 



25054 



gca gee gca aaa ttt ace att gag gaa tat aac caa tac tct tte ctg 
Ala Ala Ala Lys Phe Thr He Glu Glu Tyr Asn Gin Tyr Ser Pi-ie Leu 
1385 1390 1395 1400 



25102 



tta aaa etc aat aaa get att cgt eta tct cgt gcg aca gaa tta tea 
Leu Lys Leu Asn Lys Ala He Arg Leu Ser Arg Ala Thr Glu Leu Ser 
1405 1410 1415 



25150 



cce ace att ctg gaa agt att gtg cgt agt gtt aat cag caa ctg gat 
Pro Thr He Leu Glu Ser He Val Arg Ser Val Asn Gin Gin Leu Asp 
1420 1425 1430 



25198 



ate aac gca gaa gta tta ggt aaa gtt ttt ctg act aaa tat tat atg 
He Asn Ala Glu Val Leu Gly Lys Val Phe Leu Thr Lys Tyr Tyr Met 
1435 1440 1445 



25246 



caa cgt tat get att aat get gaa act gee eta ata eta tgc aat gca 
Gin Arg Tyr Ala He Asn Ala Glu Thr Ala Leu He Leu Cys Asn Ala 
1450 1455 1460 



25294 



ctt att tea caa cgt tea tat gat aat caa cct age caa ttt gat cgc 
Leu He Ser Gin Arg Ser lyr Asp Asn Gin Pro Ser Gin Phe Asp Arg 
1465 1470 1475 1480 



25342 
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ctg ttt aat acg cca tta ctg aac ggc caa tat ttt tct acc gga gat 25390 
Leu Phe Asn Thr Pro Leu Leu Asn Gly Gin Tyr PhB Ser Thr Gly Asp 
1485 1490 1495 

gaa gag att gat tta aat cca ggt agt act ggc gat tgg cgt aaa tec 25438 
Glu Glu He Asp L^u Asn Pro Gly Ser Thr Gly Asp Trp Arg Lys Ser 
1500 1505 1510 

gtg ctt aaa cgt gca ttt aat ate gat gat att tec etc tac cgc ctg 25486 
Val Leu Lys Arg Ala Phe Asn He Asp Asp He Ser Leu lyr Arg Leu 
1515 1520 1525 

ctt aaa att acc aac cat aat aat caa gat gga aag att aaa aat aac 25534 
Leu Lys He 'Thr Asn His Asn Asn Gin Asp Gly Lys He Lys Asn Asn 
1530 1535 1540 

tta aat aat ctt tct gat tta tat att ggg aaa tta ctg gca gaa att 25582 
Leu Asn Asn Leu Ser Asp Leu Tyr He Gly Lys Leu Leu Ala Glu He 
1545 1550 1555 1560 

cat caa tta acc att gat gaa ttg gat tta ttg ctg gtt gcc gtg ggt 25630 
His Gin Leu Itir He Asp Glu Leu Asp Leu Leu Leu Val Ala Val Gly 
1565 1570 1575 

gaa gga gaa act aat tta tec get ate agt gat aaa caa ctg gcg gca 25678 
Glu Gly Glu Thr Asn Leu Ser Ala He Ser Asp Lys Gin Leu Ala Ala 
1580 1585 1590 

ctg ate aga aaa etc aat ace att acc gtc tgg eta cag aca cag aag 25726 
I^u He Arg Lys Leu Asn Thr He Ihr Val Tirp Leu Gin Thr Gin Lys 
1595 1600 1605 

tgg agt gcg ttc caa tta ttt gtt atg act tec acc age tat aac aaa 25774 
Trp Ser Ala Phe Gin Leu Phe Val Met Thr Ser Thr Ser Tyr Asn Lys 
1610 1615 1620 

acg ctg acg cet gaa att aag aat ctg ctg gat acc gtc tac cac ggt 25822 
Thr Leu Thr Pro Glu He Lys Asn Leu Leu Asp Thr Val TVr His Gly 
1625 1630 1635 1640 

tta caa ggc ttt gat aaa gac aag gca aat tta ctg cat gtt atg gcg 25870 
Leu Gin Gly Phe Asp Lys Asp Lys Ala Asn Leu Leu His Val Met Ala 
1645 1650 1655 

ecc tat att gcg gcc acc tta caa tta tea teg gaa aat gtc gcc cat 25918 
Pro Tyr He Ala Ala Thr Leu Gin Leu Ser Ser Glu Asn Val Ala His 
1660 1665 1670 

tct gtg ctg ctt tgg gca gac aag tta aag ecc ggc gac ggc gca atg 25966 
Ser Val Leu Leu Trp Ala Asp Lys Leu Lys Pro Gly Asp Gly Ala Met 
1675 1680 1685 

aca gcc gaa aaa ttc tgg gac tgg ttg aat act caa tat acg cca gat 26014 
Thr Ala Glu Lys Phe Trp Asp Trp Leu Asn Thr Gin Tyr Thr Pro Asp 
1690 1695 1700 

tea teg gaa gta tta gca aca cag gaa cat att gtt cag tat tgt cag 26062 
Ser Ser Glu Val Leu Ala Thr Gin Glu His He Val Gin Tyr Cys Gin 
1705 1710 1715 1720 

gcg ttg gcg caa tta gaa atg gtt tac cat tec acc ggt ate aat gaa 26110 
Ala Leu Ala Gin Leu Glu Met Val Tyr His Ser Thr Gly He Asn Glu 
1725 1730 1735 
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aac gcc ttc cgc ctg ttt gtg aca aaa cca gag atg ttt ggc teg tea 
Asn Ala I*ie Arg Leu Phe Val Thr Lys Pro Glu Met Phe Gly Ser Ser 
1740 1745 1750 

act gag gca gta cct gcg cat gat gca ctt tea ctg ate atg ctg acg 
Thr Glu Ala Val Pro Ala His Asp Ala Leu Ser Leu lie Met Leu Thr 
1755 1760 1765 



eta gcg gca ttt gaa get aac agt tta acg gca gaa caa ttg get gat 
Leu Ala Ala Phe Glu Ala Asn Ser Leu Thr Ala Glu Gin Leu Ala Asp 
1785 1790 1795 1800 

gcc atg aat ctt gat get aat ttg eta ttg caa gee agt act caa gca 
Ala Met Asn Leu Asp Ala Asn Leu Leu Leu Gin Ala Ser Thr Gin Ala 
1805 1810 1815 



26158 



26206 



egt ttt gca gat tgg gtt aat gcg tta ggc gaa aaa gee tct tec gta 26254 
Arg Phe Ala Asp Trp Val Asn Ala Leu Gly Glu Lys Ala Ser Ser Val 
1770 1775 ' 1780 



26302 



26350 



caa aac cat caa cat ctt ccc cca gtg acg caa aaa aat get ttc tec 26398 
Gin Asn His Gin His Leu Pro Pro Val Thr Gin Lys Asn Ala Phe Ser 
1820 1825 1830 

tgt tgg aca tct ate gae act ate ctg caa tgg gtt aat gtt gca caa 26446 
Cys Trp Thr Ser lie Asp Thr lie Leu Gin Trp Val Asn Val Ala Gin 
1835 1840 1845 

eaa ttg aat gtc gcc cca cag gga gtt tec get ttg gtc ggg ctg gat 26494 
Gin Leu Asn Val Ala Pro Gin Gly Val Ser Ala Leu Val Gly Leu Asp 
1850 1855 I860 

tat att caa tta aat caa aaa ate ccc acc tat gee cag tgg gaa agt 26542 
TVr lie Gin Leu Asn Gin Lys He Pro Thr Tyr Ala Gin Trp Glu Ser 
1865 1870 1875 1880 

get ggg gaa ata ttg act gcc gga ttg aat tea eaa cag get gat ata 26590 
Ala Gly Glu He Leu Thr Ala Gly Leu Asn Ser Gin Gin Ala Asp He 
1885 1890 1895 



tta cac get ttt ttg gae gaa tct cgc agt gee gca tta age acc tac 
Leu His Ala Phe Leu Asp Glu Ser Arg Ser Ala Ala Leu Ser Thr TVr 
1900 1905 1910 



26638 



tat ate cgt r^^ gtc gcc aag cca gcg gca gcc ata aaa age cgt gat 
Tyr He Arg Gin Val Ala Lys Pro Ala Ala Ala He Lys Ser Arg Asp 
1915 1920 1925 



26686 



gae ttg tac caa tac tta eta att gat aat cag gtt tec get gca ate 
Asp Leu Tyr Gin TVr Leu Leu He Asp Asn Gin Val Ser Ala Ala He 
1930 1935 1940 



26734 



aaa act ace egg att gee gaa gcc att gcc age att caa ctg tac gtc 
Lys Thr Thr Arg He Ala Glu Ala He Ala Ser He Gin Leu Tyr Val 
1945 1950 1955 1960 



26782 



aac cgc acg ctg gaa aat gta gaa gaa aat gcc cat tea ggg gtt ate 
Asn Arg Thr Leu Glu Asn Val Glu Glu Asn Ala His Ser Gly Val He 
1965 1970 1975 



26830 



age cgt cag ttc ttt ate gae tgg gae aaa tat aac aaa cgc tac age 26878 
Ser Arg Gin Phe E^e He Asp Trp Asp Lys Tyr Asn Lys Arg TVr Ser 
1980 1985 1990 

ace tgg gcg ggt gtt tct caa tta gtt tac tac ccg gaa aac tat att 26926 
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Thr Trp Ala Gly Val Ser Gin leu Val Tyr Tyr Pro Glu Asn Tyr lie 
1995 2000 2005 

gat ccc acc atg cgt ate gga caa acc aaa atg atg gac gca tta ttg 26974 
Asp Pro Thr Met Arg lie Gly Gin Thr Lys Met Met Asp Ala Leu Leu 
2010 2015 2020 

caa tec gtc age caa age caa tta aat gee gat act gtc gaa gac gee 27022 
Gin Ser Val Ser Gin Ser Gin Leu Asn Ala Asp Thr Val Glu Asp Ala 
2025 2030 2035 2040 

ttt atg tct tat ctg aca teg ttt gag caa gtg get aat ctt aaa gtt 27070 
Phe Met Ser Tyr Leu Thr Ser Phe Glu Gin Val Ala Asn Leu Lys Val 
2045 2050 2055 

att age gcg tat cac gat aat att aac aac gat caa ggg ctg acc tat 27118 
lie Ser Ala Tyr His Asp Asn lie Asn Asn Asp Gin Gly Leu Thr Tyr 
2060 2065 2070 

ttt ate ggc etc agt gaa act gat acc ggt gaa tac tat tgg cgc agt 27166 
Phe lie Gly Leu Ser Glu Ihr Asp Thr Gly Glu Tyr Tyr Trp Arg Ser 
2075 2080 2085 

gtc gat cac agt aaa ttc age gac ggt aaa tte gee get aat gee tgg 27214 
Val Asp His Ser Lys Phe Ser Asp Gly Lys Phe Ala Ala Asn Ala Trp 
2090 2095 2100 

agt gaa tgg cac aaa att gat tgt cca att aat ect tac cga age act 27262 
Ser Glu Trp His Lys lie Asp Cys Pro lie Asn Pro Tyr Arg Ser Thr 
2105 2110 2115 2120 

ate cgt cct gtg atg tac aaa tec cgc ttg tat ctg etc tgg ttg gaa 27310 
lie Arg Pro Val Met Tyr Lys Ser Arg Leu Tyr Leu Leu Trp Leu Glu 
2125 2130 2135 

caa aag gag ate act aaa caa aca gga aat age aaa gat ggc tat caa 27358 
Gin Lys Glu He Ihr Lys Gin Thr Gly Asn Ser Lys Asp Gly Tyr Gin 
2140 2145 2150 

acc gag aca gat tat cgt tat gag eta aaa ttg gcg cat ate cgt tat 27406 
Thr Glu Thr Asp Tyr Arg Tyr Glu Leu Lys Leu Ala His He Arg Tyr 
2155 2160 2165 

gac ggt acc tgg aat aeg cca ate act ttt gat gtc aat gaa aaa ata 27454 
Asp Gly Thr Trp Asn Ihr Pro He Thr Phe Asp Val Asn Glu Lys He 
2170 2175 2180 

tee aag eta gaa ctg gea aaa aat aaa gcg cct ggg etc tat tgt get 27502 
Ser Lys Leu Glu Leu Ala Lys Asn Lys Ala Pro Gly Leu Tyr Cys Ala 
2185 2190 2195 2200 

ggt tat caa ggt gaa gat aeg ttg ctg gtt atg ttt tat aac caa caa 27550 
Gly lyr Gin Gly Glu Asp Thr Leu Leu Val Met Phe Tyr Asn Gin Gin 
2205 2210 2215 

gat aca etc gat agt tat aaa acc get tea atg caa ggg eta tat ate 27598 
Asp Thr Leu Asp Ser Tyr Lys Thr Ala Ser Met Gin Gly Leu Tyr He 
2220 2225 2230 

ttt gee gat atg gaa tat aaa gat atg ace gat gga caa tac aaa tct 27646 
Phe Ala Asp Met Glu Tyr Lys Asp Met Thr Asp Gly Gin Tyr Lys Ser 
2235 2240 2245 

tat egg gac aac age tat aaa caa tte gat act aat agt gtc aga aga 27694 
Tyr Arg Asp Asn Ser Tyr Lys Gin Phe Asp Thr Asn Ser Val Arg Arg 
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2250 2255 2260 

gtg aat aac cgc tat gca gag gat tat gaa att ccc tea teg gta aat 27742 
Val Asn Asn Arg lyr Ala Glu Asp Tyr Glu He Pro Ser Ser Val Asn 
2265 2270 2275 2280 

age cgt aaa ggc tat gat tgg gga gat tat tat cte agt atg gta tat 27790 
Ser Arg Lys Gly Tyr Asp Trp Gly Asp Tyr Tyr Leu Ser Met Val Tyr 
2285 2290 2295 

aac gga gat att cca act att agt tae aaa gee aca tea agt gat tta 27838 
Asn Gly Asp He Pro Thr He Ser Tyr Lys Ala Thr Ser Ser Asp Leu 
2300 2305 2310 

aaa ate tat ate teg eea aaa tta aga att att cat aat gga tat gaa 27886 
Lys He Tyr He Ser Pro Lys Leu Arg He He His Asn Gly Tyr Glu 
2315 2320 2325 

ggg cag eaa cge aat eaa tgc aat eta atg aat aaa tat ggc aaa eta 27934 
Gly Gin Gin Arg Asn Gin Cys Asn Leu Met Asn Lys Tyr Gly Lys Leu 
2330 2335 2340 

ggt gat aaa ttt att gtt tat act age ttg gga gtt aat eea aat aat 27982 
Gly Asp Lys Phe He Val Tyr Thr Ser Leu Gly Val Asn Pro Asn Asn 
2345 2350 2355 2360 

teg tea aat aag etg atg ttt tae ccc gtt tat caa tat aac gga aat 28030 
Ser Ser Asn Lys Leu Met Phe Tyr Pro Val Tyr Gin Tyr Asn Gly Asn 
2365 2370 2375 

gtc agt ggg ctt agt eaa ggg aga tta eta ttc cac cgt gac acc aat 28078 
Val Ser Gly Leu Ser Gin Gly Arg Leu Leu Phe His Arg Asp Thr Asn 
2380 2385 2390 

tat tea tct aaa gta gaa get tgg att cet gga gca gga cgt tct eta 28126 
Tyr Ser Ser Lys Val Glu Ala Trp He E>ro Gly Ala Gly Arg Ser Leu 
2395 2400 2405 

acc aat ccg aat get gcc att ggt gat gat tat get aca gac teg tta 28174 
Thr Asn Pro Asn Ala Ala He Gly Asp Asp Tyr Ala Thr Asp Ser Leu 
2410 2415 2420 

aac aaa ccg aat gat ctt aag eaa tae gtc tat atg act gac agt aaa 28222 
Asn Lys Pro Asn Asp Leu Lys Gin Tyr Val Tyr Met Thr Asp Ser Lys 
2425 2430 2435 2440 

ggt act get acc gat gtc tea gga eea gta gat ate aat act gca att 28270 
Gly Thr Ala Thr Asp Val Ser Gly Pro Val Asp He Asn Ihr Ala He 
2445 2450 2455 

tec ccg gca aaa gtt cag gta aca gta aaa gee ggt age aaa gaa caa 28318 
Ser Pro Ala Lys Val Gin Val Thr Val Lys Ala Gly Ser Lys Glu Gin 
2460 2465 2470 

aeg ttt ace gcg gat aaa aat gtc tec att cag eea tec ect age ttt 28366 
Thr Phe Thr Ala Asp Lys Asn Val Ser He Gin Pro Ser Pro Ser Phe 
2475 2480 248S 

gac gaa atg aat tat caa ttt aat get etc gaa ata gat ggc tea agt 28414 
Asp Glu Met Asn Tyr Gin Phe Asn Ala Leu Glu He Asp Gly Ser Ser 
2490 2495 2500 

ccg aac ttt act aac aat tea gcc agt att gat att acc ttt acc gca 28462 
Leu Asn Phe Thr Asn Asn Ser Ala Ser He Asp He Thr Phe Thr Ala 
2505 2510 2515 2520 
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ttt gca gag gat gga cgt aaa ctg ggt tat gaa agt ttc agt att cct 28510 
Phe Ala Glu Asp Gly Arg Lys Leu Gly Tyr Glu Ser Phe Ser lie Pro 
2525 2530 2535 

att acc cgc aag gtg agt act gat aat tec ctg acc ctg cgc cat aat 28558 
lie Ihr Arg Lys Val Ser Ihr Asp Asn Ser Leu Thr Leu Arg His Asn 
2540 2545 2550 

gaa aat ggt gcg caa tat atg caa tgg gga gtc tat cgc att cgt ctt 28606 
Glu Asn Gly Ala Gin Tyr Met Gin Trp Gly Val Tyr Arg lie Arg Leu 
2555 2560 2565 

aat act tta ttt get cgc caa tta gtt gcg cga gcc act acc ggt att 28654 
Asn Itir Leu Phe Ala Arg Gin Leu Val Ala Arg Ala Thr Thr Gly lie 
2570 2575 2580 

gat acg att ctg agt atg gaa act cag aat att cag gaa cca cag tta 28702 
Asp Thr lie Leu Ser Met Glu Thr Gin Asn lie Gin Glu Pro Gin Leu 
2585 2590 2595 2600 

ggc aaa ggt ttc tac get acg ttc gtg ata cct ccg tat aac cca tea 28750 
Gly Lys Gly Phe Tyr Ala Ihr Phe Val lie Pro E>ro Oyr Asn Pro Ser 
2605 2610 2615 

act cat ggt gat gaa cgt tgg ttt aag ctt tat ate aaa cat gtt gtt 28798 
Thr His Gly Asp Glu Arg Trp E*ie Lys Leu Tyr He Lys His Val Val 
2620 2625 2630 

gat aat aat tea cat att ate tat tea ggt cag eta aaa gat aca aat 28846 
Asp Asn Asn Ser His He He Tyr Ser Gly Gin Leu Lys Asp Ihr Asn 
2635 2640 2645 

ata age acc acg tta ttt ate cct ctt gat gat gtt cca ttg aac caa 28894 
He Ser Thr Thr Leu Phe He Pro Leu Asp Asp Val Pro Leu Asn Gin 
2650 2655 2660 

gat tac age gcc aag gtt tac atg acc ttc aag aaa tea cca tea gat 28942 
Asp Tyr Ser Ala Lys Val Tyr Met Ihr Phe Lys Lys Ser Pro Ser Asp 
2665 2670 2675 2680 

ggt acc tgg tgg ggc cct eac ttt gtt aga gat gat aaa gga ata gta 28990 
Gly Thr Trp Trp Gly Pro His Phe Val Arg Asp Asp Lys Gly He Val 
2685 2690 2695 

aca ata aac cct aaa tee att ttg ace eac ttt gag age gtc aat gtc 29038 
Thr He Asn Pro Lys Ser He Leu Ihr His the Glu Ser Val Asn Val 
2700 2705 2710 

ctg aat aat att agt age gaa cca atg gat ttc age ggc get aac age 29086 
Leu Asn Asn He Ser Ser Glu Pro Met Asp Phe Ser Gly Ala Asn Ser 
2715 2720 2725 

etc tat ttt tgg gaa ctg ttc tac tat ace ccg atg ctg gtt gcc caa 29134 
Leu Tyr Phe Trp Glu Leu Phe Tyr Tyr Thr Pro Met Leu Val Ala Gin 
2730 2735 2740 

cgt ttg ttg cat gag caa aac ttt gat gaa gcg aac cgc tgg ctg aaa 29182 
Arg Leu Leu His Glu Gin Asn Phe Asp Glu Ala Asn Arg Trp Leu Lys 
2745 2750 2755 2760 

tat gtc tgg age cca tec ggg tat att gtt eac ggc cag att cag aat 29230 
Tyr Val Trp Ser Pro Ser Gly lyr He Val His Gly Gin He Gin Asn 
2765 2770 2775 
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cat caa tgg aac gtc cgc ccg tta ttg gaa gat acc agt tgg aac agt 
Tyr Gin Trp Asn Val Arg Pro Leu Leu Glu Asp Ttir Ser Trp Asn Ser 
2780 2785 2790 



29278 



gat cct ttg gat tec gtc gat cct gac gcg gta gcg cag cac gat ccg 29326 
Asp Pro Leu Asp Ser Val Asp Pro Asp Ala Val Ala Gin His Asp Pro 
2795 2800 2805 



atg cac tat aaa gtt tea acc ttt atg cgc acc ctt gat ctg ttg ate 
Met His Tyr Lys Val Ser Ita- Phe Met Arg Thr Leu Asp L^u L^u He 
2810 2815 2820 

gcg cgc ggc gac cat get tac cgc caa ttg gag cgc gat acg ctt aac 
Ala Arg Gly Asp His Ala Tyr Arg Gin Leu Glu Arg Asp Thr Leu Asn 
2825 2830 2835 2840 

gaa gcg aag atg tgg tat atg caa gcg ctg cat ctg tta ggc gat aaa 
Glu Ala Lys Met Trp Tyr Met Gin Ala Leu His Leu Leu Gly Asp Lys 
2845 2850 2855 

cct tat ctg ccg ctg agt acc aca tgg aat gat cca cga ctg gac aaa 
Pro Tyr Leu Pro Leu Ser Thr Ttir Trp Asn 7^ Pro Arg 1^ Asp Lys 
2860 2865 2870 

gee gcg gat att act acc caa agt get cat tec age tea ata gtc get 
Ala Ala Asp He Thr Thr Gin Ser Ala His Ser Ser Ser He Val Ala 
2875 2880 2885 

ttg egg cag agt aca ccg gcg ctt tta tea ttg cgc age gee aat acc 
Leu Arg Gin Ser Thr Pro Ala Leu Leu Ser Leu Arg Ser Ala Asn Thr 
2890 2895 2900 

ctg acc gat etc tte ctg ccg caa ate aat gaa gtg atg atg aat tac 
Leu Thr Asp Leu Phe Leu Pro Gin He Asn Glu Val Met Met Asn Tyr 
2905 2910 2915 2920 

tgg caa aca tta get cag aga gta tac aac ctg cgc cac aac etc tet 
Trp Gin rhr Leu Ala Gin Arg Val Tyr Asn Leu Arg His Asn Leu Ser 
2925 2930 2935 



29374 



29422 



29470 



29518 



29566 



29614 



29662 



29710 



ate gac ggt cag ccg tta tat ctg cca ate tat gee aca ccg gcg gac 
He Asp Gly Gin Pro Leu lyr Leu Pro He Tyr Ala Thr Pro Ala Asp 
2940 2945 2950 



29758 



ccg aaa gcg tta etc age gee get gtt gee act tct caa ggt gga ggc 
Pro Lys Ala Leu Leu Ser Ala Ala Val Ala Thr Ser Gin Gly Gly Gly 
2955 2960 2965 



29806 



aag ctg ccg gag tea ttt atg tec ctg tgg egt tte ccg cac atg ctg 
Lys Leu Pro Glu Ser Phe Met Ser Leu Trp Arg Phe Pro His Met Leu 
2970 2975 2980 

gaa aat get cgc age atg gtt age cag etc acc caa tte ggc tec aeg 
Glu Asn Ala Arg Ser Met Val Ser Gin Leu Thr Gin Phe Gly Ser Thr 
2985 2990 2995 3000 

tta caa aat att ate gaa cgt cag gac gea gaa gcg etc aat gcg tta 
Leu Gin Asn He He Glu Arg Gin Asp Ala Glu Ala Leu Asn Ala Leu 
3005 3010 3015 

tta caa aat cag gee gea gag ctg ata ttg act aac ctg agt att caa 
Leu Gin Asn Gin Ala Ala Glu Leu He Leu Thr Asn Leu Ser He Gin 
3020 3025 3030 

gac aaa acc att gaa gaa ctg gat gee gag aaa acc gtg ctg gaa aaa 



29854 



29902 



29950 



29998 



30046 
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Asp Lys Thr lie Glu Glu Leu Asp Ala Glu Lys Thr Val Leu Glu Lys 
3035 3040 3045 



tec gcg gga gca caa teg egc ttt gat age tat age aaa ctg cat 
Ser Lys Ala Gly Ala Gin Ser Arg Phe Asp Ser Tyr Ser Lys Leu His 
3050 3055 3060 



30094 



gat gaa aac ate aac gee ggt gaa aae caa get atg aeg eta cga gcg 
Asp Glu Asn lie Asn Ala Gly Glu Asn Gin Ala Met Ihr Leu Arg Ala 
3065 3070 3075 3080 



30142 



tec gca gcc ggg ctt acc aeg gcg gtt cag gca tec cgt ctg gee ggc 
Ser Ala Ala Gly Leu Thr Thr Ala Val Gin Ala Ser Arg Leu Ala Gly 
3085 3090 3095 



30190 



gca gcg get gat ctg gtg cet aac ate ttc ggc ttc gcc ggt ggt ggt 
Ala Ala Ala Asp Leu Val Pro Asn lie Phe Gly Phe Ala Gly Gly Gly 
3100 3105 3110 



30238 



age cgt tgg ggg get ate get gag gcg acc ggc tat gta atg gaa ttt 
Ser Arg Trp Gly Ala He Ala Glu Ala Thr Gly Tyr Val Met Glu Phe 
3115 3120 3125 



30286 



tee get aat gtt atg aat aee gaa gcg gat aaa att age caa tet gaa 
Ser Ala Asn Val Met Asn Ihr Glu Ala Asp Lys He Ser Gin Ser Glu 
3130 3135 3140 



30334 



aee tac cgt cgt cgc cgt cag gag tgg gaa att cag cgt aat aat gee 
Thr Tyr Arg Arg Arg Arg Gin Glu Trp Glu He Gin Arg Asn Asn Ala 
3145 3150 3155 3160 



30382 



gaa gcg gag ctg aaa caa etc gat gcc caa ctt aaa teg ctg gea gta 
Glu Ala Glu Leu Lys Gin Leu Asp Ala Gin Leu Lys Ser Leu Ala Val 
3165 3170 3175 



30430 



cgc cgt gaa gcc gcc gta ttg caa aaa acc age ctg aaa acc caa caa 
Arg Arg Glu Ala Ala Val Leu Gin Lys Thr Ser Leu Lys T!hr Gin Gin 
3180 3185 3190 



30478 



gag cag acc caa gee caa ttg gcc ttc ctg caa cgt aag ttc age aat 
Glu Gin Vnr Gin Ala Gin Leu Ala Phe Leu Gin Arg Lys Phe Ser Asn 
3195 3200 3205 



30526 



caa gcg ttg tac aac tgg eta cgt ggc cga ctg gca gca att tac ttc 
Gin Ala Leu Tyr Asn Trp Leu Arg Gly Arg Leu Ala Ala He Tyr Phe 
3210 3215 3220 



30574 



caa ttc tac gac ttg get ate gcg cgt tgt tta atg gea gag cag get 
Gin Phe Tyr Asp Leu Ala He Ala Arg cys Leu Met Ala Glu Gin Ala 
3225 3230 3235 3240 



30622 



tac cgt tgg gaa att age gat gac tct get egc ttt att aaa ccg ggc 
Tyr Arg Trp Glu He Ser Asp Asp Ser Ala Arg Phe He Lys Pro Gly 
3245 3250 3255 



30670 



gcc tgg caa gga ace tat gea ggt ctg ctg gca ggt gaa ace ttg atg 
Ala Trp Gin Gly Thr Tyr Ala Gly Leu Leu Ala Gly Glu Thr Leu Met 
3260 ' ^ 3265- 3270 



30718 



eta agt ttg gca caa atg gaa gac gcc cat tta aga cgc gat aaa cgc 
Leu Ser Leu Ala Gin Met Glu Asp Ala His Leu Arg Arg Asp Lys Arg 
3275 3280 3285 



30766 



gca tta gag gtc gaa cgt aca gta teg ctg gcc gaa att tat get ggt 30814 
Ala Leu Glu Val Glu Arg Thr Val Ser Leu Ala Glu He Tyr Ala Gly 
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3290 



3295 



3300 



cca ccg caa gac aaa ggc cca ttc tec ctg acg caa gaa ate gag aag 
Leu Pro Gin Asp Lys Gly Pro Phe Ser Leu Thr Gin Glu lie Glu Lys 
3305 3310 3315 3320 

cLo gtg aac gca ggt tea ggc age gee ggc agt ggt aat aat aat ttg 
beu Val Asn Ala Gly Ser Gly Ser Ala Gly Ser Gly Asn Asn Asn Leu 
3325 3330 3335 



30862 



30910 



gca ttt ggc gee ggc acg gae act aaa act tet ttg cag gca tec att 
:Aa Phe Gly Ala Gly Ttir Asp Thr Lys Ihr Ser Leu Gin Ala Ser lie 
3340 3345 3350 

cca tta gcc gat tta aaa att cgt gag gat tac ccg gaa tct att ggc 
Ser Leu Ala Asp Leu Lys He Arg Glu Asp Tyr Pro Glu Ser He Gly 
3355 3360 3365 



30958 



31006 



aaa ate ega cgc ate aaa cag ate age gtt acc ctg ccg gcg eta ttg 31054 
Lys He Arg Arg He Lys Gin He Ser Val Thr Leu Pro Ala Leu Leu 
3370 3375 3380 

gga ect tat cag gat gtg cag gca ata tta tct tac ggc gat aaa gee 31102 
Gly Pro Tyr Gin Asp Val Gin Ala He Leu Ser Tyr Gly Asp Lys Ala 
3385 3390 3395 3400 

gga tta gcg aac ggc tgt gca gcg ctg gcc gtt tec cac ggt acg aat 31150 
Glv Leu Ala Asn Gly Cys Ala Ala Leu Ala Val Ser His Gly Thr Asn 
3405 3410 3415 

gae age ggt caa ttc cag etc gat ttc aac gat ggc aaa ttc ctg ccg 31198 
Asp Ser Gly Gin Phe Gin Leu Asp Phe Asn Asp Gly Lys Phe Leu Pro 
3420 3425 3430 

ttt gaa ggt ate gee att gat caa ggt acg eta aca ctg agt ttt ect 31246 
Phe Glu Gly He Ala He Asp Gin Gly Thr Leu Thr Leu Ser Phe Pro 
3435 3440 3445 



aat gca tea acg cca gcc aaa ggt aaa caa gcc act atg tta aaa acc 
Asn Ala Ser Thr Pro Ala Lys Gly Lys Gin Ala TThr Met Leu Lys Thr 
3450 3455 3460 



31294 



ccg aac gat ate att ttg cat att cgc tac acc att aag taa 
Leu Asn Asp He He Leu His He Arg Tyr Thr He Lys 
3465 3470 3475 



31336 



ceatcccaac acagaactaa gacaggcece gaateggggt etggtaagga gtttet atg 31395 

Met 



cag aat tea cag aca etc age atg acc gag ctg tea tta ect aag ggc 
Gin Asn Ser Gin Thr Phe Ser Met Thr Glu Leu Ser Leu Pro Lys Gly 
3480 3485 3490 3495 



31443 



ggc ggc gee att acc ggt atg ggt gaa gca tta acg ccg gee ggg ccg 
Gly Gly Ala He Thr Gly Met Gly Glu Ala Leu Ttir Pro Ala Gly Pro 
3500 3505 3510 



31491 



gae ggt atg gca gee tta teg ctg cca ttg ccc att tct gcc gga cgt 
Asp Gly Met .Ma Ala Leu Ser Leu Pro Leu Pro He Ser Ala Gly Arg 
3515 3520 3525 



31539 



ggt tat gee ccc teg etc acg ctg aac tac aac age gga ace ggt aac 
Gly Tyr Ala E>ro Ser Leu Thr Leu Asn Tyr Asn Ser Gly Ttir Gly Asn 
3530 3535 3540 



31587 
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age ccg ttc ggt etc ggt tgg gac tgt aac gtc atg aca att cgt cgt 31635 
Ser Pro Phe Gly Gly Trp Asp Cys Asn Val Met Thr He Arg Arq 
3545 3550 3555 

cgc acc agt acc ggc gtg ccg aat tat gat gaa acc gat act ttt ctg 31683 
Arg Thr Ser Ihr Gly Val Pro Asn Tyr Asp Glu Thr Asp Thr Phe Leu 
3560 3565 3570 3575 

ggg ccg gaa ggt gaa gtg ttg gtc gta gca tta aat gag gca ggt caa 31731 
Gly Pro Glu Gly Glu Val Leu Val Val Ala Leu Asn Glu Ala Gly Gin 
3580 3585 3590 

get gat ate cge agt gaa tec tea tta cag ggc ate aat ttg ggg atg 31779 
Ala Asp He Arg Ser Glu Ser Ser Leu Gin Gly He Asn Leu Gly Met 
3595 3600 3605 

ace ttc acc gtt acc ggt tat cgc tec cgt ttg gaa age cac ttt age 31827 
Thr Phe Thr Val Thr Gly Tyr Arg Ser Arg Leu Glu Ser His Phe Ser 
3610 3615 3620 

egg ttg gaa tac tgg caa ccc caa aca aca ggc gca acc gat ttc tgg 31875 
Arg Leu Glu Tyr Trp Gin Pro Gin Thr Thr Gly Ala Ihr Asp Phe Trp 
3625 3630 3635 

ctg ata tac age ccc gac gga caa gee cat tta ctg ggc aaa aat cct 31923 
Leu He Tyr Ser Pro Asp Gly Gin Ala His Leu Leu Gly Lys Asn Pro 
36^0 3645 3650 3655 

caa gca cgc ate age aat eca eta aat gtt aac caa aca gcg caa tgg 31971 
Gin Ala Arg He Ser Asn Pro Leu Asn Val Asn Gin Thr Ala Gin Trp 
3660 3665 3670 

eta ttg gaa gee teg gta tea tec cac ggc gag cag att tat tat cag 32019 
Leu Leu Glu Ala Ser Val Ser Ser His Gly Glu Gin He Tyr Tyr Gin 
3675 3680 3685 

^t cga gee gaa gat gaa act gat tgc gaa act gac gaa etc aca gee 32067 
lyr Arg Ala Glu Asp Glu Thr Asp Cys Glu Ihr Asp Glu Leu Thr Ala 
3690 3695 3700 

cac ccg aac aca acc gtc cag cgc tac ctg caa gta gta cat tac got 32115 
His Pro Asn Thr Thr Val Gin Arg lyr Leu Gin Val Val His Tyr Glv 
3705 3710 3715 

^ gat gat eca 32163 

Asn I^u Thr Ala Ser Glu Val Phe Pro Thr Leu Asn Gly Asp Lp Pro 
■^'20 3725 3730 3735 

etc aaa tet ggc tgg ttg ttc tgt tta gta ttt gat tac ggt gag cgc 32211 
Leu Lys Ser Gly Trp Leu Phe Cys Leu Val Phe Asp Tyr Gly Glu Arg 
3740 3745 3750 

aaa aac age tta tet gaa atg ccg eca ttt aaa gee aca agt aac tgg 32259 
Lys Asn Ser Leu Ser Glu Met Pro Pro Phe Lys Ala Thr Ser Asn Trp 
3755 3760 3765 

ett tgc cgc aaa gac cgt ttt tec cgt tat gaa tac ggt ttt gca ttg 32307 
Leu Cys Arg Lys Asp Arg Phe Ser Arg Tyr Glu Tyr Gly Phe Ala Leu 
3770 3775 3780 

cge ^c egg cge tta tgt cgc caa ata ctg atg ttt cac cgt ctg caa 32355 
Arg Ttir Arg Arg Cys Arg Gin He Leu Met Phe His Arg Leu Gin 
3785 3790 3795 

acc ctg tet ggt cag gca aaa ggc gac gat gaa ccc gca tta gtt tea 32403 
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"Vhr Leu Ser Gly Gin Ala Lys Giy Asp Asp Glu Pro Ala Leu Val Ser 
3800 3805 3810 3815 

cgt ctg ata ctg gat tat gac gaa aac gcg gtg gtc agt acg etc gtt 32451 
Arg Leu He Leu Asp Tyr Asp Glu Asn Ala Val Val Ser Ihr lieu Val 
3820 3825 3830 

tct gtc cgc cga gtg gga cat gag caa gat ggc aca acg gcg gtc gcc 32499 
Ser Val Arg Arg Val Gly His Glu Gin Asp Gly Ihr Thr Ala Val Ala 
3835 3840 3845 

ctg ccg cca ttg gaa ctg get tat cag cct ttt gaa cca gaa caa aaa 32547 
Leu Pro Pro Leu Glu Leu Ala Tyr Gin Pro Phe Glu Pro Glu Gin Lys 
3850 3855 3860 

gca etc tgg cga cca atg gat gta ctg gcg aat ttc aac ace ate caa 32595 
Ala Leu Trp Arg Pro Met Asp Val Leu Ala Asn Phe Asn Itir He Gin 
3865 3870 3875 

cgc tgg caa ctg ctt gat ctg caa ggc gaa ggc gta ccc ggt att ctg 32643 
Arg Trp Gin Leu Leu Asp Leu Gin Gly Glu Gly Val Pro Gly He Leu 
3880 3885 3890 3895 

tat cag gat aaa aat ggc tgg tgg tat cga tct get caa cgt cag aca 32691 
lyr Gin Asp Lys Asn Gly Trp Trp Tyr Arg Ser Ala Gin Arg Gin Ihr 
3900 3905 3910 

ggg gaa gag atg aat gcg gtc acc tgg ggc aaa atg caa etc ctt cct 32739 
Gly Glu Glu Met Asn Ala Val Thr Trp Gly Lys Met Gin Leu Leu Pro 
3915 3920 3925 

ate acg ccc get att cag gat aac gcc tea ctg atg gat att aat ggt 32787 
He Thr Pro Ala He Gin Asp Asn Ala Ser Leu Met Asp He Asn Gly 
3930 3935 3940 

gat ggg caa ctg gat tgg gtt ate acc ggt ccg ggg eta agg ggt Cat 32835 
Asp Gly Gin Leu Asp Trp Val He Thr Gly Pro Gly Leu Arg Gly Tyr 
3945 3950 3955 

cac age cag cat cca gat ggc agt tgg aca cgt ttt acg ccg ttg cac 32883 
His Ser Gin His Pro Asp Gly Ser Trp Hir Arg Phe Thr Pro Leu His 
3960 3965 3970 3975 

gcc tta ccg ata gaa tat acc cat ccc cgc gee caa ctt gcg gat tta 32931 
Ala Leu Pro He Glu Tyr Thr His Pro Arg Ala Gin Leu Ala Asp Leu 
3980 3985 3990 

atg ggg gcc ggg ctg tec gat tta gtg ctg att ggt ccc aaa age gtg 32979 
Met Gly Ala Gly Leu Ser Asp Leu Val Leu He Gly Pro Lys Ser Val 
3995 4000 4005 

cgt ttg tat gcc aat aac cgt gat ggt ttt acc gaa gga egg gat gtg 33027 
Arg l£u Tyr Ala Asn Asn Arg Asp Gly Phe Thr Glu Gly Arg Asp Val 
4010 4015 4020 

gtg caa tec ggt ggt ate acc ctg ccg tta ccg ggc gee gat gcg cgt 33075 
Val Gin Ser Gly Gly He Thr Leu Pro Leu Pro Gly Ala Asp Ala Arg 
4025 4030 4035 

aag tta gtg gcc ttt age gac gta etc ggt tea ggc caa gca cat ttg 33123 
Lys Val Ala Phe Ser Asp Val Leu Gly Ser Gly Gin Ala His Leu 
4040 4045 4050 4055 

gtt gaa gtt agt gcg acg aaa gtc acc tgc tgg cca aat ctg gga cat 33171 
Val Glu Val Ser Ala Thr Lys Val Thr cys T^ Pro Asn Leu Gly His 
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4060 4065 4070 

ggc cgt ttt ggt cag cca ate aca ttg ccg gga ttt age eaa tec gee 33219 
Gly Arg E^e Gly Gin Pro lie Thr Leu Pro Gly Phe Ser Gin Ser Ala 
4075 4080 4085 

gee aat ttt aat cet gat cga gtt cat etg gee gat ctg gac ggt agt 33267 
Ala Asn Phe Asn Pro Asp Arg Val His Leu Ala Asp Leu Asp Gly Ser 
4090 4095 4100 

ggt cct gcc gat ctg att tat gtt cat get gac cat ctg gat att tte 33315 
Gly Pro Ala Asp Leu lie Tyr Val His Ala Asp His Leu Asp He Phe 
4105 4110 4115 

age aat gaa agt ggt aac ggt ttt gea eaa cca tte aca etc egt ttt 33363 
Ser Asn Glu Ser Gly Asn Gly PhB Ma Gin Pro Phe Thr Leu Arg Phe 
4120 4125 4130 4135 

ect gac ggc etg egt ttt gat gat act tge cag eta eaa gtg get gat 33411 
Pro Asp Gly Leu Arg Phe Asp Asp Thr Cys Gin Leu Gin Val Ala Asp 
4140 4145 4150 

gta eag gga tta ggg gtt gte age etg ate etg age gta eeg eat atg 33459 
Val Gin Gly Leu Gly Val Val Ser Leu He Leu Ser Val Pro His Met 
4155 4160 4165 

geg cca cae eat tgg cgc tgc gat ctg acc aac gcg aaa ccg tgg tta 33507 
Ala Pro His His Trp Arg Cys Asp Leu Thr Asn Ala Lys Pro Trp Leu 
4170 4175 4180 

etc agt gaa atg aac aac aac atg gga gcc cat eac acc ctg cat tac 33555 
Leu Ser Glu Met Asn Asn Asn Met Gly Ala His His Thr Leu His Tyr 
4185 4190 4195 

cgt age tec gtc cag ttt tgg ctg gat gaa aaa gcc gea gcc tta get 33603 
Arg Ser Ser Val Gin Phe Trp Leu Asp Glu Lys Ala Ala Ala Leu Ala 
4200 4205 4210 4215 

acc gga caa aca ccg gtc tgt tac ctg cec tte ccg gte eat ace etg 33651 
Thr Gly Gin Thr Pro Val Cys Tyr Leu Pro Phe Pro Val His Thr Leu 
4220 4225 4230 

tgg eaa aca gaa ace gag gat gaa ate age ggc aat aaa tta gtg ace 33699 
Trp Gin Thr Glu Thr Glu Asp Glu He Ser Gly Asn Lys Leu Val Thr 
4235 4240 4245 

act tta egt tac get cae ggc gcc tgg gat gga cgt gag egg gaa ttt 33747 
rhr Leu Arg Tyr Ala His Gly Ala Trp Asp Gly Arg Glu Arg Glu Phe 
4250 4255 4260 

cgc ggc ttt ggc tat gtt gag eag aca gac age cat caa ctg get caa 33795 
Arg Gly Phe Gly lyr Val Glu Gin Thr Asp Ser His Gin Leu Ala Gin 
4265 4270 4275 

ggc aat gcg ccg gaa cgt aca tea ccg gea cct ace aaa aac tgg tat 33843 
Gly Asn Ala Pro Glu Arg Thr Ser Pro Ala Leu Itir Lys Asn Trp Tyr 
4280 4285 4290 4295 

gee ace gga ate cct gag gta gac aat aeg eta tet gcc ggg tat tgg 33891 
Ala Thr Gly He Pro Glu Val Asp Asn Thr Leu Ser Ala Gly Tyr Trp 
4300 4305 4310 

cgc ggt gat aeg eag get tte act ggt ttt aeg cca cac ttt act etc 33939 
Arg Gly Asp Thr Gin Ala E^e Ttir Gly Phe Thr Pro His Phe Thr Leu 
4315 4320 4325 
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tgg aaa gag ggc aaa gat gtt cca ctg aca ccg gaa gat gac cac aat 
Trp Lys Glu Gly Lys Asp Val Pro Leu Thr Pro Glu Asp Asp His Asn 
4330 4335 4340 

ctg tac tgg tta aac egg gca eta aaa ggt caa cca ctg cgt agt gaa 
Leu Tyr Trp Leu Asn Arg Ala Leu Lys Gly Gin Pro Leu Arg Ser Glu 
4345 4350 4355 

etc tac ggg eta gat ggc age gca cag cag aag ate ccc tat aca gtg 
Leu Tyr Gly Leu Asp Gly Ser Ala Gin Gin Lys lie Pro Tyr Thr Val 
4360 4365 4370 4375 

act gaa tec cgc cca caa gtg cgc caa tta caa gat aac act ace ctt 
Thr Glu Ser Arg Pro Gin Val Arg Gin Leu Gin Asp Asn Thr Thr Leu 
4380 4385 4390 



33987 



34035 



34083 



34131 



tec ecg gtg etc tgg gee tea gtg gtg gaa agt cgt agt tat cac tat 
Ser Pro Val Leu Trp Ala Ser Val Val Glu Ser Arg Ser Tyr His Tyr 
4395 4400 4405 



34179 



gaa cgt ate ate age gat ccc caa tgc aat cag gat ate act ctg tee 
Glu Arg lie lie Ser Asp Pro Gin Cys Asn Gin Asp He Thr Leu Ser 
4410 4415 4420 



34227 



agt gac eta ttc ggg caa ecg ctg aaa cag gtt tea gtg caa tat ccc 
Ser Asp Leu Kie Gly Gin Pro Leu Lys Gin Val Ser Val Gin Tyr Pro 
4425 4430 4435 



34275 



cgc cgc aat aaa cca aca ace aat ccg tat ccc gat aca eta cca gat 34323 
Arg Arg Asn Lys Pro Thr Thr Asn Pro Tyr Pro Asp Thr Leu Pro Asp 
4440 4445 4450 4455 

act ctg ttt gee age agt tat gac gac caa caa caa eta ttg egg tta 34371 
Thr Leu Phe Ala Ser Ser Ty»^ Asp Asp Gin Gin Gin Leu Leu Arg Leu 
4460 4465 4470 



ace tac cag caa tec agt tgg cat cat eta att get aat gaa etc aga 
Thr Tyr Gin Gin Ser Ser Trp His His Leu He Ala Asn Glu Leu Arg 
4475 4480 4485 



34419 



gtg tta gga tta ecg gat ggt aca cgc agt gat get ttc act tac gat 34467 
Val Leu Gly Leu Pro Asp Gly Thr Arg Ser Asp Ala Phe Thr Tyr Asp 
4490 4495 4500 



get aaa cac gtg ect gtt gat ggt tta aat ctg gaa get eta tgt get 
Ala Lys His Val Pro Val Asp Gly Leu Asn Leu Glu Ala Leu Cys Ala 
4505 4510 4515 



34515 



gaa aat age ctg att gcc gat gat aaa ect cgc gaa tac etc aac cag 
Glu Asn Ser Leu He Ala Asp Asp Lys Pro Arg Glu Tyr Leu Asn Gin 
4520 4525 4530 4535 



34563 



caa cga acg ttc tat ace gat ggg aaa ace gat gga aaa aat cca acg 
Gin Arg Thr Phe Tyr Thr Asp Gly Lys Thr Asp Gly Lys Asn Pro Thr 
4540 4545 4550 



34611 



cca ctg aaa aca ccg aca cga cag get tta ate gcc ttt ace gaa acg 
Pro Leu Lys Thr Pro Thr Arg Gin Ala Leu He Ala Phe Thr Glu Thr 
4555 4560 4565 



34659 



geg gta tta acg gaa tec ctg tta tec gca ttt gat ggc ggt ate acg 
Ala Val Leu Thr Glu Ser Leu Leu Ser Ala Phe Asp Gly Gly He Thr 
4570 4575 4580 



34707 
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cca gat gaa tta ccc ggc ctt ctg aca caa gca gga tac caa caa gaa 34755 
Pro Asp Glu Leu Pro Gly Leu Leu Thr Gin Ala Gly Tyr Gin Gin Glu 
4585 4590 4595 

cct tat ctg ttc cca etc agt ggc gaa aac caa gtc tgg gta gca cgc 34803 
Pro TVr Leu Phe Pro Leu Ser Gly Glu Asn Gin Val Trp Val Ala Arg 
4600 4605 4610 4615 

aaa ggc tat acc gat tac gga act gag gta caa ttt tgg cgt cct gtc 34851 
Lys Gly Tyr Thr Asp Tyr Gly Thr Glu Val Gin Phe Trp Arg Pro Val 
4620 4625 4630 

gca caa cgt aac acc cag tta acc ggg aaa acg act eta aaa tgg gat 34899 
Ala Gin Arg Asn Thr Gin Leu Thr Gly Lys Thr Ttvr Leu Lys Trp Asp 
4635 4640 4645 

acc cac tac tgt gtc ate act caa acc caa gae gcg get ggt ttg act 34947 
Thr His lyr Cys Val lie Thr Gin Ihr Gin Asp Ala Ala Gly Leu Thr 
4650 4655 4660 

gtc tea gee aat tat gac tgg cgt ttt etc aca cct atg caa ctg act 34995 
Val Ser Ala Asn Tyr Asp Trp Arg Phe Leu Thr Pro Met Gin Leu Thr 
4665 4670 4675 

gat ate aac gat aat gtg cat ate ata acc ttg gat gcg eta gga cgc 35043 
Asp lie Asn Asp Asn Val His lie He Thr Leu J^sp Ala Leu Gly Arg 
4680 4685 4690 4695 

cct gtc act caa cgt ttc tgg gga ate gaa aat ggt gtg gca aca ggt 35091 
Pro Val Thr Gin Arg Phe Trp Gly He Glu Asn Gly Val Ala Thr Gly 
4700 4705 4710 

tac tct tea cca gaa gca aaa cca ttc act cca cca gtc gat gtc aat 35139 
Tyr Ser Ser Pro Glu Ala Lys Pro Phe Thr Pro Pro Val Asp Val Asn 
4715 4720 4725 

get gee att get ctg ace gga cca etc cct gtc gcg cag tgt ctg gtc 35187 
Ala Ala He Ala Leu Thr Gly Pro Leu Pro Val Ala Gin Cys Leu Val 
4730 4735 4740 

tat gcg ccg gac agt tgg atg ccg eta ttc ggt cag gaa acc ttc aac 35235 
Tyr Ala Pro Asp Ser Trp Met Pro Leu Phe Gly Gin Glu Thr Hie Asn 
4745 4750 4755 

aca tta acg cag gaa gag caa aag aca ctg cgt gat tta egg att ate 35283 
Thr Leu Thr Gin Glu Glu Gin Lys Thr Leu Arg Asp Leu Arg He He 
4760 4765 4770 4775 

aca gaa gat tgg cgt att tgc gca ctg get cgc cgc cgt tgg eta caa 35331 
Thr Glu Asp Trp Arg He Cys Ala Leu Ala Arg Arg Arg Trp Leu Gin 
4780 4785 4790 

agt caa aaa gee ggc aca cca ttg gtt aag ctg tta acc aac age ate 35379 
Ser Gin Lys Ala Gly Thr Pro Leu Val Lys Leu Leu Thr Asn Ser He 
4795 4800 4805 

ggt tta cct ccc cac aac etc atg ctg get acg gac cgt tat gac cgt 35427 
Gly Leu Pro Pro His Asn Leu Met Leu Ala Thr Asp Arg Tyr Asp Arg 
4810 4815 4820 

gat tct gaa cag caa att cgt caa caa gtc gca ttc agt gat ggt ttt 35475 
Asp Ser Glu Gin Gin He Arg Gin Gin Val Ala Phe Ser Asp Gly Phe 
4825 4830 4835 

ggc cgt ttg ttg caa gcg get gtg egg eat gag gca ggc gaa gee tgg 35523 



BNSDOCID: <WO 9942589A2J_> 



wo 99/42589 PCT/EP99/01 01 5 

-52- 



Glv Arg beu Leu Gin Ala Ala Val Arg His Glu Ala Gly Glu Ala Trp 
4840 4845 4850 4855 

caa cgt aac caa gac ggt tct ctg gtg aca aaa atg gaa gat acc aaa 
Gin Arg Asn Gin Asp Gly Ser Leu Val Thr Lys Met Glu Asp Thr Lys 
4860 4865 4870 



35571 



acg cgc tgg gcg att acg gga cgc act gaa tat gac aat aag ggg cag 35619 
Ttir Arg Trp Ala He Thr Gly Arg Thr Glu Tyr Asp Asn Lys Gly Gin 
4875 4880 4885 

gcg ata cga act tat cag ccc tat ttc etc aat gac tgg cga tat gtg 35667 
Ala He Arg Thr Tyr Gin Pro Tyr Phe Leu Asn Asp Trp Arg Tyr Val 
4890 4895 4900 

agt gat gac age gcc aga aaa gag gcc tat gcc gat act cat ate tat 35715 
Ser Asp Asp Ser Ala Arg Lys Glu Ala Tyr Ala Asp Thr His He Tyr 
4905 4910 4915 

gat ccg att ggg egg gaa ate caa gtt ate acg gea aaa gge tgg ctg 35763 
Asp Pro He Gly Arg Glu He Gin Val He Thr Ala Lys Gly Trp Leu 
4920 4925 4930 4935 

egg cag aac caa tat ttc ccg tgg ttt acc gtg agt. gaa gat gaa aat 35811 
Arg Gin Asn Gin Tyr Phe Pro Trp Phe Thr Val Ser Glu Asp Glu Asn 
4940 4945 4950 

gat ttg tec get gac gcg etc gtg taa ttgaateaag attcgctcgt 35858 
Asp Leu Ser Ala Asp Ala Leu Val 

4955 4960 

ttaatgttaa cgagcgaata taatatacct aatagattte gagttgcage gcggcggcaa 35918 

gtgaacgaat ccccaggagc atagataact atgtgactgg ggtgagtgaa agcagccaac 35978 

aaagcagcag cttgaaagat gaagggtata aataagaaac tgcattgtga gttctaaata 36038 

gagtagcage atattttatt geettttatt teataggtaa taaaatteaa ttgctgtaaa 36098 

aatctgtcat eatgagaact aaaaataaca actttctett ctgcaagaga aatcaataat 36158 

tcaattaaaa atgttataga atctgaatca agaccatttg ttggctcatc aaaaatataa 36218 

acatccgcat cggtaataaa agetgatgte aatagaaatt tettttttat eccaagtgae 36278 

atatgtccat actcaatace agaataatta gatataecaa aaccatttaa atagtaatct 36338 ' 

aattgatatt ttaaattact tttcctataa cgetgactta aattaatcac atceattcce 36398 

gtgatgaaat tataaaagtt aacattatcc gatagataaa aaccatgctg ttgeaaatta 36458 

aatcggctct tttctccett ttttataaaa ttaaccatte cttttttaac cttatttaca 36518 

ccagcaatae ttgaaagaaa agregtttta cecgecceat taactcccgc aatacggttt 36578 

aateeaaccc gaaaatcaca attgactect gaaaaaatag tcttaceatt aataacaaee 36638 

tetaacccaa taacttcaag cataaataac ccctaaaaat aacgtaaaaa agaaaataac 36698 

accaacaata ataattttcg tgtattgcgt tetcaacaga gaaatagaag aaacaataat 36758 

agaaagaaaa gcataagata aaaatataat cacaggaaaa gatttaacaa caagaaagca 36818 

^ f^j^ aaraaaa aaacaaagca aataaaaaaa caaagaaata ecataattaa aaaagaatat 36878 
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tttccgcaca gataaaaagt tggacaaata tgaaagataa tttatttcaa tatatgatag 36938 
attataaaat aacaacatgc atatatataa aacaacactg gcatatatta atgatatata 36998 
atcagccttg tttgggattt gagaaaaggc actttcacat aatagatata aaagcagaac 37058 
agataatgcc ataataagca cagacatttt atttttatta aaacaataac gaagattcat 37118 
tatataaggc aatgaaaaaa aacctgatga aaataatttt ttatttctat taattatata 37178 
acatggtgtg aaattcaaat ataatatcaa tgctactaat ggaataacta atgtaaaaat 37238 
caaatcatat aatattccac tcctgaatga tgccgccaga agaaagaaca cagcaacaat 37298 
aaaaaaatgc aaaaaactta attcaaataa gcaaaatcca attacagcaa aagaaactat 37358 
caaaaaaaac acagatgaaa ggtaatgcaa ataattaaca ttttcgtaaa aaaaacctat 37418 
aaagaagaaa ataactatcg gaaaagcact ataaataaaa aaaacgatac gactaaaaaa 37478 
caacgttttt ttacctacca aagaaacgat gattgaattc tcctttgcag aaggaaaaaa 37538 
ccttatgtta atcaaataaa ataccatata taccattaaa gatatggcag 37598 
cgattctatg tagccatctg gaataacaat attggaagat aaagttatta aaacctcaaa 37658 
gataccactg aactttgccg gaagtaataa aagaaaaagg aatataatga catttttatt 37718 
cccagacgca aatttcttta tcctaccttt atattccaag gcatcagcga ttattaaatt 37778 
catactgcct ctctaaaacc aaaatctaaa taatgtcctt ggtgaatctt tagggaattt 37838 
cgtcctggaa tgcaaatata aatagttact gaaaacaata cattgatttt taattaaata 37898 
ctggcgatat gaccttaatg atgctacttt attttccagt attcaattcg 37948 

<210> 12 
<211> 954 
<212> PRT 

<213> Photorhabdus luminescens 
<400> 12 

Met Lys Asn He Asp Pro Lys Leu Tyr Gin Lys Thr Pro Val Val Asn 

15 10 15 

He Tyr Asp Asn Arg Gly Leu Thr He Arg Asn He Asp Phe His Arg 
20 25 30 

Thr Thr Ala Asn Gly Asp Thr Asp He Arg He Thr Arg His Gin Tyr 
35 40 45 

Asp Ser Leu Gly His Leu Ser Gin Ser Thr Asp Pro Arg Leu Tyr Glu 
50 55 60 

Ala Lys Gin Lys Ser Asn Phe Leu Trp Gin Tyr Asp Leu Thr Gly Asn 
65 70 75 80 

He Leu Cys Thr Glu Ser Val Asp Ala Gly Arg Thr Val Thr Leu Asn 
85 90 95 

Asp He Glu Gly Arg Pro Leu Leu Thr Val Thr Ala Thr Gly Val He 
100 105 110 

Gin Thr Arg Gin Tyr Glu Thr Ser Ser Leu Pro Gly Arg Leu Lieu Ser 
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115 120 125 

Val Thr Glu Gin He Pro Glu Lys Thr Ser Arg He Itir Glu Arg I^u 
130 135 140 

He Trp Ala Gly Asn Ser Glu Ala Glu Lys Asn His Asn Leu Ala Ser 

145 150 155 160 

GLn Cys Val Arg His lyr Asp Thr Ala Gly Val Thr Arg Leu Glu Ser 
165 170 175 

Leu Ser Leu Thr Gly Thr Val Leu Ser Gin Ser Ser Gin Leu Leu Ser 
180 185 190 

Asp Thr Gin Glu Ala Ser Trp Thr Gly Asp Asn Glu Thr Val Trp Gin 
195 200 205 

Asn Met Leu Ala Asp Asp He Tyr TTir Thr Leu Ser Ala Phe Asp Ala 
210 215 220 

Thr Gly Ala Leu Leu Thr Gin Thr Asp Ala Lys Gly Asn He Gin Arg 
225 230 235 240 

Leu Thr Tyr Asp Val Ala Gly Gin Leu Asn Gly Ser Trp Leu Thr Leu 
245 250 255 

Lys Asp Gin Pro Glu Gin Val He He Arg Ser Leu Thr Tyr Ser Ala 
260 265 270 

Ala Gly Gin Lys Leu Arg Glu Glu His Gly Asn Gly Val He Thr Glu 
275 280 285 

Tyr Ser Tyr Glu Pro Glu Thr Gin Gin Leu He Gly Thr Lys Thr His 
290 295 300 

Arg Pro Ser Asp Ala Lys Val Leu Gin Asp Leu Arg Tyr Glu Tyr Asp 
305 310 315 320 

Pro Val Gly Asn Val He Ser He Arg Asn Asp Ala Glu Ala Thr Arg 
325 330 335 

Phe Trp His Asn Gin Lys Val Ala Pro Glu Asn Tte lyr Itir Tyr Asp 
340 345 350 

Ser Leu Tyr Gin Leu He Ser Ala Thr Gly Arg Glu Met Ala Asn He 
355 360 365 ' 

Gly Gin Gin Ser Asn Gin Leu Pro Ser Leu Thr Leu Pro Ser Asp Asn 
370 375 380 

Asn Thr Tyr Thr Asn Tyr Thr Arg Thr Tyr Thr Tyr Asp Arg Gly Gly 
385 390 395 400 

Asn Leu Thr Lys He Gin His Ser Ser Pro Ala Thr Gin Asn Asn Tyr 
405 410 415 

rhr Thr Asn He Tnr Val Ser Asn Arg Ser Asn Arg Ala Val Leu Ser 
420 425 430 

Thr Leu Thr Glu Asp Pro Ala Gin Val Asp Ala Leu Phe Asp Ala Gly 
435 440 445 

Gly His Gin Asn Thr Leu He Ser Gly Gin Asn Leu Asn Trp Asn 'Vhr 
450 455 460 
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Arg Gly Glu Leu Gin His Val Thr Leu Val Lys Arg Asp Lys Gly Ala 
465 470 475 480 

Asn Asp Asp Arg Glu Trp Tyr Arg Tyr Ser Ser Asp Gly Arg Arg lie 
485 490 495 

Leu Lys lie Asn Glu Gin Gin Thr Ser Ser Asn Ser Gin Thr Gin Arg 
500 505 510 

He Thr Tyr Leu Pro Ser Leu Glu Leu Arg Leu Thr Gin Asn Ser Ttir 
515 520 525 

He Thr Thr Glu Asp Leu Gin Val He Thr Val Gly Glu Ala Gly Arg 
530 535 540 

Ala Gin Val Arg Val Leu His Trp Asp Ser Gly Gin Pro Glu Asp He 
545 550 555 560 

Asp Asn Asn Gin Leu Arg Tyr Ser Tyr Asp Asn Leu He Gly Ser Ser 
565 570 575 

Gin Leu Glu Leu Asp Ser Lys Gly Giu He He Ser Glu Glu Glu Tyr 
580 585 590 

Tyr Pro Tyr Gly Gly Ihr Ala Leu Trp Ala Thr Arg Lys Arg Thr Glu 

595 600 605 

Ala Ser Tyr Lys Thr He Arg lyr Ser Gly Lys Glu Arg Asp Ala Thr 
610 615 620 

Gly Leu Tyr Tyr lyr Gly Tyr Arg Tyr Tyr Gin Pro Trp Val Gly Arg 
625 630 635 640 

Trp Leu Ser Ala Asp Pro Ala Gly Thr Val Asp Gly Leu Asn Leu Oyr 
645 650 655 

Arg Met Val Arg Asn Asn Pro Val Thr Leu Leu Asp Pro Asp Gly Leu 
660 665 670 

Met Pro Thr He Ala Glu Arg He Ala Ala Leu Gin Lys Asn Lys Val 
675 680 685 

Ala Asp Ser Ala Pro Ser Pro Thr Asn Ala Thr Asn Val Ala He Asn 
690 695 700 

He Arg Pro Pro Val Ala Pro Lys Pro Thr Leu Pro Lys Ala Ser Thr 

705 710 715 720 

Ser Ser Gin Ser Thr Thr Tyr Pro He Lys Ser Ala Ser He Lys Pro 
725 730 735 

Thr Thr Ser Gly Ser Ser He Thr Ala Pro Leu Ser Pro Val Gly Asn 
740 745 750 

Lys Ser Thr Pro Glu He Ser Leu Pro Glu Ser Thr Gin Ser Asn Ser 
755 760 765 

Ser Ser Ala He Ser Thr Asn Leu Gin Lys Lys Ser Phe Thr Leu Tyr 
770 775 780 

Arg Ala Asp Asn Arg Ser Phe Glu Asp Met Gin Ser Lys Phe Pro Glu 
785 790 795 800 

Gly Phe Lys Ala Trp Thr Pro Leu Asp Thr Lys Met Ala Arg Gin Phe 
805 810 815 
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Ala Ser Val Phe lie Gly Gin Lys Asp Thr Ser Asn Leu Pro Lys Glu 
820 825 830 

Thr Val Lys Asn lie Asn Thr Trp Gly Thr Lys Pro Lys Leu Asn Asp 
835 840 845 

Leu Ser Thr Tyr lie Lys Tyr Thr Lys Asp Lys Ser Thr Val Trp Val 
850 855 860 

Ser Thr Ala He Asn Thr Glu Ala Gly Gly Gin Ser Ser Gly Ala Pro 
865 870 875 880 

Leu His Glu He Asn Met Asp Leu Tyr Glu Phe Thr He Asp Gly Gin 
885 890 895 

Lys Leu Asn Pro Leu Pro Arg Gly Arg Ser Lys Asp Arg Val Pro Ser 
900 905 910 

Leu Leu Leu Asp Thr Pro Glu He Glu Thr Ala Ser He He Ala Leu 
915 920 925 

Asn His Gly Pro Val Asn Asp Ala Glu Val Ser Phe Leu Thr Itir He 
930 935 940 

Pro Leu Lys Asn Val Lys Pro Tyr Lys Arg 
945 950 



<210> 13 
<211> 2522 
<212> PRT 

<213> EtiotX)rhabdus luminescens 
<400> 13 

Met He Leu Lys Gly He Asn Met Asn Ser Pro Val Lys Glu He Pro 
15 10 15 

Asp Val Leu Lys He Gin Cys Gly Phe Gin Cys Leu Thr Asp He Ser 
20 25 30 

His Ser Ser Phe Asn Glu Phe His Gin Gin Val Ser Glu His Leu Ser 
35 40 45 

Trp Ser Glu Ala His Asp Leu Tyr His Asp Ala Gin Gin Ala Gin Lys 
50 55 60 

Asp Asn Arg Leu lyr Glu Ala Arg He Leu Lys Arg Thr Asn Pro Gin 
65 70 75 80 

Leu Gin Asn Ala Val His Leu Ala He Val Ala Pro Asn Ala Glu Leu 
85 90 95 

He Gly Tyr Asn Asn Gin Phe Ser Gly Arg Ala Ser Gin lyr Val Ala 
100 105 110 

Pro Gly Thr Val Ser Ser Met Phe Ser Pro Ala Ala Tyr Leu Thr Glu 
115 120 125 

Leu Tyr Arg Glu Ala Arg Asn Leu His Ala Ser Asp Ser Val Tyr Arg 
130 135 140 

Leu Asp Thr Arg Arg Pro Asp Leu Lys Ser Met Ala Leu Ser Gin Gin 
145 150 155 160 
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Asn Met Asp Ttir Glu Leu Ser Thxr Leu Ser Leu Ser Asn Glu Leu Leu 
165 170 175 

Leu Glu Ser lie Lys Thr Glu Ser Lys Leu Asp Asn Tyr Utrr Gin Val 
180 185 190 

Met Glu Met Leu Ser Ala Phe Arg Pro Ser Gly Ala Thr Pro Tyr His 
195 200 205 

Asp Ala Tyr Glu Asn Val Arg Lys Val lie Gin Leu Gin Asp Pro Gly 
210 215 220 

Leu Glu Gin Leu Asn Ala Ser Pro Ala lie Ala Gly Leu Met His Gin 
225 230 235 240 

Ala Ser Leu Leu Gly lie Asn Ala Ser lie Ser Pro Glu Leu Phe Asn 
245 250 255 

lie Leu Thr Glu Glu lie Thr Glu Gly Asn Ala Glu Glu Leu Tyr Lys 
260 265 270 

Lys Asn Phe Gly Asn lie Glu Pro Ala Ser Leu Ala Met Pro Glu Tyr 
275 280 285 

Leu Arg Arg Tyr Tyr Asn Leu Ser Asp Glu Glu Leu Ser Gin Ph^ He 
290 295 300 

Gly Lys Ala Ser Asn Phe Gly Gin Gin Glu Tyr Ser Asn Asn Gin Leu 
305 310 315 320 

He Thr Pro He Val Asn Ser Asn Asp Gly Thr Val Lys Val Tyr Arg 
325 330 335 

He Thr Arg Glu Tyr Thr Thr Asn Ala Asn Gin Val Asp Val Glu Leu 
340 345 350 

Phe Pro Tyr Gly Gly Glu Asn Tyr Gin Leu Asn Tyr Lys Phe Lys Asp 
355 360 365 

Ser Arg Gin Asp Val Ser Tyr Leu Ser He Lys Leu Asn Asp Lys Arg 
370 375 380 

Glu Leu He Arg He Glu Gly Ala Pro Gin Val Asn He Glu Tyr Ser 
385 390 395 400 

Glu His He Ohr Leu Ser Thr Thr Asp He Ser Gin Pro Phe Glu He 
405 410 415 

Gly Leu Thr Arg Val Tyr Pro Ser Ser Ser Trp Ala Tyr Ala Ala Ala 
420 425 430 

Lys Phe Thr He Glu Glu Tyr Asn Gin Tyr Ser Phe Leu Leu Lys Leu 
435 440 445 

Asn Lys Ala He Arg Leu Ser Arg Ala Thr Glu Leu Ser Pro Thr He 
450 455 460 

Leu Glu Ser He Val Arg Ser Val Asn Gin Gin* Leu Asp He Asn Ala 
465 470 475 480 

Glu Val Leu Gly Lys Val Phe Leu Ihr Lys Tyr lyr Met Gin Arg Tyr 
485 490 495 

Ala He Asn Ala Glu Thr Ala Leu He Leu Cys Asn Ala Leu He Ser 
500 505 510 
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Gln Arg Ser Tyr Asp Asn Gin Pro Ser Gin E^e Asp Arg Leu Phe Asn 

515 520 525 

Thr Pro Leu Leu Asn Gly Gin Tyr Phe Ser Thr Gly Asp Glu Glu lie 
530 535 540 

Asp Leu Asn Pro Gly Ser Thr Gly Asp Trp Arg Lys Ser Val Leu Lys 
545 550 555 560 

Arg Ala Phe Asn lie Asp Asp lie Ser Leu Tyr Arg Leu Leu Lys lie 
565 570 575 

IVir Asn His Asn Asn Gin Asp Gly Lys He Lys Asn Asn Leu Asn Asn 
580 585 590 

Leu Ser Asp Leu Tyr He Gly Lys Leu Leu Ala Glu He His Gin Leu 

595 600 605 

Itir He Asp Glu Leu Asp Leu Leu Leu Val Ala Val Gly Glu Gly Glu 
610 615 620 

'Vhr Asn Leu Ser Ala He Ser Asp Lys Gin Leu Ala Ala Leu He Arg 
625 630 635 640 

Lys Leu Asn Thr He Thr Val Trp Leu Gin Thr Gin Lys Trp Ser Ala 
645 650 655 

E^e Gin Leu Phe Val Met Thr Ser Thr Ser Tyr Asn Lys Thr Leu Thr 
660 665 670 

Pro Glu He Lys Asn Leu Leu Asp Ito Val Tyr His Gly Leu Gin Gly 
675 680 685 

Phe Asp Lys Asp Lys Ala Asn Leu Leu His Val Met Ala Pro Tyr He 
690 695 700 

Ala Ala Thr Leu Gin Leu Ser Ser Glu Asn Val Ala His Ser Val Leu 
705 710 715 720 

Leu Trp Ala Asp Lys Leu Lys Pro Gly Asp Gly Ala Met Thr Ala Glu 
725 730 735 

Lys Phe Trp Asp Trp Leu Asn Ihr Gin Tyr Ihr Pro Asp Ser Ser Glu 
740 745 750 

Val Leu Ala Thr Gin Glu His He Val Gin Tyr Cys Gin Ala Leu Ala 
755 760 765 

Gin Leu Glu Met Val Tyr His Ser Thr Gly He Asn Glu Asn Ala Phe 
770 775 780 

Arg Leu Phe Val Thr Lys Pro Glu Met Phe Gly Ser Ser Thr Glu Ala 
785 790 795 800 

Val Pro Ala His Asp Ala Leu Ser Leu He Met Leu Thr Arg me Ala 
805 810 815 

Asp Trp Val Asn Ala Leu Gly Glu Lys Ala Ser Ser Val Leu Ala Ala 
820 825 830 

Phe Glu Ala Asn Ser Leu Thr Ala Glu Gin Leu Ala Asp Ala Met Asn 
835 840 845 

Leu Asp Ala Asn Leu Leu Leu Gin Ala Ser Thr Gin Ala Gin Asn His 
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850 855 860 

Gin His Leu Pro Pro Val Thr Gin Lys Asn Ala Phe Ser Cys Trp Thr 
865 870 875 880 

Ser lie Asp Thr lie Leu Gin Trp Val Asn Val Ala Gin Gin Leu Asn 
885 890 895 

Val Ala Pro Gin Gly Val Ser Ala Leu Val Gly Leu Asp Tyr lie Gin 
900 905 910 

Leu Asn Gin Lys lie Pro Thr lyr Ala Gin Trp Glu Ser Ala Gly Glu 
915 920 925 

lie Leu Thr Ala Gly Leu Asn Ser Gin Gin Ala Asp lie Leu His Ala 

930 935 940 

Phe Leu Asp Glu Ser Arg Ser Ala Ala Leu Ser Thr Tyr Tyr lie Arg 
945 950 955 960 

Gin Val Ala Lys Pro Ala Ala Ala lie Lys Ser Arg Asp Asp Leu Tyr 
965 970 975 

Gin Tyr Laa Leu lie Asp Asn Gin Val Ser Ala Ala lie Lys Thr Thr 
980 985 990 

Arg lie Ala Glu Ala lie Ala Ser lie Gin Leu Tyr Val Asn Arg Thr 
995 1000 1005 

Leu Glu Asn Val Glu Glu Asn Ala His Ser Gly Val lie Ser Arg Gin 
1010 1015 1020 

Phe Phe He Asp Trp Asp Lys Tyr Asn Lys Arg Tyr Ser Thr Trp Ala 
025 1030 1035 1040 

Gly Val Ser Gin Leu Val Tyr Tyr Pro Glu Asn Tyr He Asp Pro Thr 
1045 1050 1055 

Met Arg He Gly Gin Thr Lys Met Met Asp Ala Leu Leu Gin Ser Val 
1060 1065 1070 

Ser Gin Ser Gin Leu Asn Ala Asp Thr Val Glu Asp Ala Phe Met Ser 
1075 1080 1085 

Tyr Leu Thr Ser Phe Glu Gin Val Ala Asn Leu Lys Val He Ser Ala 
1090 1095 1100 

Tyr His Asp Asn He Asn Asn Asp Gin Gly Leu Thr Tyr Phe He Gly 
105 1110 1115 1120 

Leu Ser Glu Thr Asp Thr Gly Glu Tyr Tyr Trp Arg Ser Val Asp His 
1125 1130 1135 

Ser Lys Phe Ser Asp Gly Lys Phe Ala Ala Asn Ala Trp Ser Glu Trp 
1140 1145 1150 

His Lys He Asp Cys Pro He Asn Pro Tyr Arg Ser Thr He Arg Pro 
1155 1160 116S 

Val Met Tyr Lys Ser Arg Leu Tyr Leu Leu Trp Leu Glu Gin Lys Glu 
1170 1175 1180 

He Thr Lys Gin Thr Gly Asn Ser Lys Asp Gly Tyr Gin Thr Glu Thr 
185 1190 1195 1200 
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Asp lyr Arg Tyr Glu Leu Lys Leu Ala His lie Arg Tyr Asp Gly Thr 
1205 1210 1215 

Trp Asn Thr Pro lie Hir Phe Asp Val Asn Glu Lys lie Ser Lys Leu 
1220 1225 1230 

Glu Leu Ala Lys Asn Lys Ala Pro Gly Leu Tyr Cys Ala Gly Tyr Gin 
1235 1240 1245 

Gly Glu Asp Ihr Leu Leu Val Met Phe Tyr Asn Gin Gin Asp Thr Leu 
1250 1255 1260 

Asp Ser Tyr Lys Itir Ala Ser Met Gin Gly Leu Tyr He Phe Ala Asp 
265 1270 1275 1280 

Met Glu Tyr Lys Asp Met Thr Asp Gly Gin Tyr Lys Ser Tyr Arg Asp 
1285 1290 1295 

Asn Ser Tyr Lys Gin Phe Asp Thr Asn Ser Val Arg Arg Val Asn Asn 
1300 1305 1310 

Arg Tyr Ala Glu Asp Tyr Glu He Pro Ser Ser Val Asn Ser Arg Lys 
1315 1320 1325 

Gly Tyr Asp Trp Gly Asp Tyr Tyr Leu Ser Met Val Tyr Asn Gly Asp 
1330 1335 1340 

He Pro Thr He Ser Tyr Lys Ala Thr Ser Ser Asp Leu Lys He Tyr 
345 1350 1355 1360 

He Ser Pro Lys Leu Arg He He His Asn Gly Tyr Glu Gly Gin Gin 
1365 1370 1375 

Arg Asn Gin Cys Asn Leu Met Asn Lys Tyr Gly Lys Leu Gly Asp Lys 
1380 1385 1390 

PhB He Val Tyr Thr Ser Leu Gly Val Asn Pro Asn Asn Ser Ser Asn 
1395 1400 1405 

Lys Leu Met Phe Tyr Pro Val Tyr Gin Tyr Asn Gly Asn Val Ser Gly 
1410 1415 1420 

L^u Ser Gin Gly Arg Leu Leu Phe His Arg Asp Thr Asn Tyr Ser Ser 
425 1430 1435 1440 

Lys Val Glu Ala Trp He Pro Gly Ala Gly Arg Ser Leu Ttir Asn Pro 
1445 1450 1455 

Asn Ala Ala He Gly Asp Asp Tyr Ala Thr Asp Ser Leu Asn Lys Pro 
1460 1465 1470 

Asn Asp Leu Lys Gin Tyr Val Tyr Met Thr Asp Ser Lys Gly Thr Ala 
1475 1480 1485 

Thr Asp Val Ser Gly Pro Val Asp He Asn Thr Ala He Ser Pro Ala 
1490 1495 1500 

Lys Val Gin Val Thr Val Lys Ala Gly Ser Lys Glu Gin Thr Phe Thr 
505 1510 1515 1520 

Ala Asp Lys Asn Val Ser He Gin Pro Ser Pro Ser Phe Asp Glu Met 
1525 1530 1535 

Asn Tyr Gin Phe Asn Ala Leu Glu He Asp Gly Ser Ser Leu Asn Phe 
1540 1545 1550 
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Thr Asn Asn Ser Ala Ser lie Asp lie Itir Phe Ihr Ala Phe Ala Glu 
1555 1560 1565 

Asp Gly Arg Lys Leu Gly Tyr Glu Ser Phe Ser lie Pro He Thr Arg 
1570 1575 1580 

Lys Val Ser Thr Asp Asn Ser Leu Thr Leu Arg His Asn Glu Asn Gly 
585 1590 1595 1600 

Ala Gin Tyr Met Gin Trp Gly Val Tyr Arg He Arg Leu Asn Ihr Leu 
1605 1610 1615 

Phe Ala Arg Gin Leu Val Ala Arg Ala Thr Thr Gly He Asp Ihr He 
1620 1625 1630 

Leu Ser Met Glu Thr Gin Asn He Gin Glu Pro Gin Leu Gly Lys Gly 
1635 1640 1645 

Phe Tyr Ala Ttir Phe Val He Pro Pro Tyr Asn Pro Ser Thr His Gly 
1650 1655 1660 

Asp Glu Arg Trp Phe Lys Leu Tyr He Lys His Val Val Asp Asn Asn 
665 1670 1675 1680 

Ser His He He Tyr Ser Gly Gin Leu Lys Asp Thr Asn He Ser Thr 
1685 1690 1695 

Hir Leu Phe He Pro Leu Asp Asp Val Pro Leu Asn Gin Asp Tyr Ser 
1700 1705 1710 

Ala Lys Val Tyr Met Thr Phe Lys Lys Ser Pro Ser Asp Gly Tbr Trp 
1715 1720 1725 

Trp Gly Pro His Phe Val Arg Asp Asp Lys Gly He Val Thr He Asn 
1730 1735 1740 

Pro Lys Ser He Leu Thr His Phe Glu Ser Val Asn Val Leu Asn Asn 
745 1750 1755 1760 

He Ser Ser Glu Pro Met Asp Phe Ser Gly Ala Asn Ser Leu Tyr Phe 
1765 1770 1775 

Trp Glu Leu Phe Tyr Tyr Thr Pro Met Leu Val Ala Gin Arg Leu Leu 
1780 1785 1790 

His Glu Gin Asn Phe Asp Glu Ala Asn Arg Trp Leu Lys Tyr Val Trp 
1795 1800 1805 

Ser Pro Ser Gly Tyr He Val His Gly Gin He Gin Asn Tyr Gin Trp 
1810 1815 1820 

Asn Val Arg Pro Leu Leu Glu Asp Thr Ser Trp Asn Ser Asp Pro Leu 
825 1830 1835 1840 

Asp Ser Val Asp Pro Asp Ala Val Ala Gin His Asp Pro Met His Tyr 
1845 1850 1855 

Lys Val Ser Thr Phe Met Arg Thr Leu Asp Leu Leu He Ala Arg Gly 
1860 1865 1870 

Asp His Ala Tyr Arg Gin Leu Glu Arg Asp Thr Leu Asn Glu Ala Lys 
1875 1880 1885 

Met Trp lyr Met Gin Ala Leu His Leu Leu Gly Asp Lys Pro Tyr Leu 
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1890 1895 1900 

Pro Leu Ser Thr Thr Trp Asn Asp Pro Arg Leu Asp Lys Ala Ala Asp 
905 1910 1915 1920 

lie Thr Ttir Gin Ser Ala His Ser Ser Ser lie Val Ala Leu Arg Gin 
1925 1930 1935 

Ser Thr Pro Ala Leu Leu Ser Leu Arg Ser Ala Asn Thr Leu Thr Asp 
1940 1945 1950 

Leu Phe Leu Pro Gin lie Asn Glu Val Met Met Asn Tyr Trp Gin Thr 
1955 I960 1965 

Leu Ala Gin Arg Val Tyr Asn Leu Arg His Asn I^u Ser He Asp Gly 
1970 1975 1980 

Gin Pro Leu Tyr Leu Pro He Tyr Ala Thr Pro Ala Asp Pro Lys Ala 
985 1990 1995 2000 

Leu Leu Ser Ala Ala Val Ma Ttir Ser Glxi Gly Gly Gly Lys Leu Pro 
2005 2010 2015 

Glu Ser Phe Met Ser Leu Trp Arg Phe Pro His Met leu Glu Asn Ala 
2020 2025 2030 

Arg Ser Met Val Ser Gin Leu Thr Gin Phe Gly Ser Ihr Leu Gin Asn 
2035 2040 2045 

He He Glu Arg Gin Asp Ala Glu Ala Leu Asn Ala Leu Leu Gin Asn 
2050 2055 2060 

Gin Ala Ala Glu Leu He Leu Thr Asn Leu Ser He Gin Asp Lys Thr 
065 2070 2075 2080 

He Glu Glu Leu Asp Ala Glu Lys Thr Val Leu Glu Lys Ser Lys Ala 
2085 2090 2095 

Gly Ala Gin Ser Arg E*ie Asp Ser Tyr Ser Lys Leu His Asp Glu Asn 
2100 2105 2110 

He Asn Ala Gly Glu Asn Gin Ala Met Thr Leu Arg Ala Ser Ala Ala 
2115 2120 2125 

Gly Leu Thr Thr Ala Val Gin Ala Ser Arg Leu Ala Gly Ala Ala Ala 
2130 2135 2140 

Asp Leu Val Pro Asn He Phe Gly Phe Ala Gly Gly Gly Ser Arg Trp 
145 2150 2155 2160 

Gly Ala He Ala Glu Ala Thr Gly Tyr Val Met Glu Phe Ser Ala Asn 
2165 2170 2175 

Val Met Asn Thr Glu Ala Asp Lys He Ser Gin Ser Glu Thr Tyr Arg 
2180 2185 2190 

Arg Arg Arg Gin Glu Trp Glu He Gin Arg Asn Asn Ala Glu Ala Glu 
2195 2200 2205 

Leu Lys Gin Leu Asp Ala Gin Leu Lys Ser Leu Ala Val Arg Arg Glu 
2210 2215 2220 

Ala Ala Val Leu Gin Lys Thr Ser Leu Lys Thr Gin Gin Glu Gin Thr 
225 2230 2235 2240 
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Gin Ala Gin lieu Ala Phe Leu Gin Arg Lys Phe Ser Asn Gin Ala Leu 
2245 2250 2255 

Tyr Asn Trp Leu Arg Gly Arg Leu Ala Ala lie Tyr Phe Gin Phe Tyr 

2260 2265 2270 

Asp Leu Ala lie Ala Arg Cys Ijeu Met Ala Glu Gin Ala Tyr Arg Trp 
2275 2280 2285 

Glu lie Ser Asp Asp Ser Ala Arg Phe lie Lys Pro Gly Ala Trp Gin 
2290 2295 2300 

Gly Ihr Tyr Ala Gly Leu Leu Ala Gly Glu Thr Leu Met Leu Ser Leu 
305 2310 2315 2320 

Ala Gin Met Glu Asp Ala His Leu Arg Arg Asp Lys Arg Ala Leu Glu 
2325 2330 2335 

Val Glu Arg Ihr Val Ser Leu Ala Glu lie Tyr Ala Gly Leu Pro Gin 
2340 2345 2350 

Asp Lys Gly Pro Phe Ser Leu Thr Gin Glu lie Glu Lys Leu Val Asn 
2355 2360 2365 

Ala Gly Ser Gly Ser Ala Gly Ser Gly Asn Asn Asn Leu Ala Phe Gly 
2370 2375 2380 

Ala Gly Thr Asp Thr Lys Thr Ser Leu Gin Ala Ser lie Ser Leu Ala 
385 2390 2395 2400 

Asp Leu Lys lie Arg Glu Asp Tyr Pro Glu Ser lie Gly Lys lie Arg 
2405 2410 2415 

Arg lie Lys Gin lie Ser Val Thr Leu Pro Ala Leu Leu Gly Pro Tyr 
2420 2425 2430 

Gin Asp Val Gin Ala lie Leu Ser Tyr Gly Asp Lys Ala Gly Leu Ala 
2435 2440 2445 

Asn Gly Cys Ala Ala Leu Ala Val Ser His Gly Thr Asn Asp Ser Gly 
2450 2455 2460 

Gin Phe Gin Leu Asp Phe Asn Asp Gly Lys Phe Leu Pro Phe Glu Gly 
465 2470 2475 2480 

He Ala He Asp Gin Gly Thr Leu Ihr Leu Ser Phe Pro Asn Ala Ser 
2485 2490 2495 

Thr Pro Ala Lys Gly Lys Gin Ala Thr Met Leu Lys Thr Leu Asn Asp 
2500 2505 2510 

He He Leu His He Arg Tyr Thr He Lys 
2515 2520 



<210> 14 
<211> 1481 
<212> PRT 

<213> Photorhabdus luminescens 
<400> 14 

Met Gin Asn Ser Gin Ttir Phe Ser Met Thr Glu Leu Ser Leu Pro Lys 
15 10 15 

Gly Gly Gly Ala He Ihr Gly Met Gly Glu Ala Leu Thr Pro Ala Gly 
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20 



25 



30 



Pro Asp Gly Met Ala Ala Leu Ser Leu Pro Leu Pro He Ser Ala Gly 
35 40 45 

Arg Gly lyr Ala Pro Ser Leu Ihr Leu Asn Tyr Asn Ser Gly Thr Gly 
50 55 60 

Asn Ser Pro Phe Gly Leu Gly Trp Asp Cys Asn Val Met Thr He Arg 
65 70 75 80 

Arg Arg Thr Ser Thr Gly Val Pro Asn Tyr Asp Glu Thr Asp Thr £*ie 
85 90 95 

Leu Gly Pro Glu Gly Glu Val Leu Val Val Ala Leu Asn Glu Ala Gly 
100 105 110 

Gin Ala Asp He Arg Ser Glu Ser Ser Leu Gin Gly He Asn Leu Gly 
115 120 125 

Met Thr Phe Thr Val Thr Gly Tyr Arg Ser Arg Leu Glu Ser His Phe 
130 135 140 

Ser Arg Leu Glu Tyr Trp Gin Pro Gin 'Vhr Thr Gly Ala Thr Asp I%ie 
145 150 155 160 

Trp Leu He Tyr Ser Pro Asp Gly Gin Ala His Leu Leu Gly Lys Asn 
165 170 175 

Pro Gin Ala Arg He Ser Asn Pro Leu Asn Val Asn Gin Thr Ala Gin 
180 185 190 

Trp Leu Leu Glu Ala Ser Val Ser Ser His Gly Glu Gin He Tyr Tyr 
195 200 205 

Gin Tyr Arg Ala Glu Asp Glu Thr Asp Cys Glu Thr Asp Glu Leu Thr 
210 215 220 

Ala His Pro Asn Thr Thr Val Gin Arg Tyr Leu Gin Val Val His Tyr 
225 230 235 240 

Gly Asn Leu Thr Ala Ser Glu Val Phe Pro Thr Leu Asn Gly Asp Asp 
245 250 255 

Pro Leu Lys Ser Gly Trp Leu Fhe Cys Leu Val Phe Asp Tyr Gly Glu 
260 265 270 

Arg Lys Asn Ser Leu Ser Glu Met Pro Pro Phe Lys Ala Thr Ser Asn 

275 280 285 

Trp Leu Cys Arg Lys Asp Arg Phe Ser Arg Tyr Glu Tyr Gly Phe Ala 
290 295 300 

Leu Arg Thr Arg Arg Leu Cys Arg Gin He Leu Met Phe His Arg Leu 
305 310 315 320 

Gin Thr Leu Ser Gly Gin Ala Lys Gly Asp Asp Glu Pro Ala Leu Val 
325 330 335 

Ser Arg Leu He Leu Asp Tyr Asp Glu Asn Ala Val Val Ser Thr Leu 
340 345 350 



Val Ser Val Arg Arg Val Gly His Glu Gin Asp Gly Thr Thr Ala Val 
355 360 365 
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Ala Leu Pro Pro Leu Glu Leu Ala Tyr Gin Pro Phe Glu Pro Glu Gin 
370 375 380 

Lys Ala Leu Trp Arg Pro Met Asp Val Leu Ala Asn Phe Asn "nir lie 
385 390 395 400 

Gin Arg Trp Gin Leu Leu Asp Leu Gin Gly Glu Gly Val Pro Gly lie 
405 410 415 

Leu Tyr Gin Asp Lys Asn Gly Trp Trp Tyr Arg Ser Ala Gin Arg Gin 
420 425 430 

Thr Gly Glu Glu Met Asn Ala Val Thr Trp Gly Lys Met Gin Leu Leu 
435 440 445 

Pro lie Thr Pro Ala lie Gin Asp Asn Ala Ser Leu Met Asp He Asn 
450 455 460 

Gly Asp Gly Gin Leu Asp Trp Val He Thr Gly Pro Gly Leu Arg Gly 
465 470 475 480 

Tyr His Ser Gin His Pro Asp Gly Ser Trp Thr Arg Phe Thr Pro Leu 
485 490 495 

His Ala Leu Pro He Glu Tyr Thr His Pro Arg Ala Gin Leu Ala Asp 
500 505 510 

Leu Met Gly Ala Gly Leu Ser Asp Leu Val Leu He Gly Pro Lys Ser 
515 520 525 

Val Arg Leu Tyr Ala Asn Asn Arg Asp Gly Phe Thr Glu Gly Arg Asp 

530 535 540 

Val Val Gin Ser Gly Gly He Thr Leu Pro Leu Pro Gly Ala Asp Ala 
545 550 555 560 

Arg Lys Leu Val Ala Phe Ser Asp Veil Leu Gly Ser Gly Gin Ala His 
565 570 575 

Leu Val Glu Val Ser Ala Thr Lys Val Thr Cys Trp Pro Asn Leu Gly 
580 585 590 

His Gly Arg Phe Gly Gin Pro He Thr Leu Pro Gly Phe Ser Gin Ser 
595 600 605 

Ala Ala Asn Phe Asn Pro Asp Arg Veil His Leu Ala Asp Leu Asp Gly 
610 615 620 

Ser Gly Pro Ala Asp Leu He Ty^r Val His Ala Asp His Leu Asp He 
625 630 635 640 

Phe Ser Asn Glu Ser Gly Asn Gly Phe Ala Gin Pro Phe Thr Leu Arg 
645 650 655 

Phe Pro Asp Gly Leu Arg Phe Asp Asp Thr Cys Gin Leu Gin Val Ala 
660 665 670 

Asp Val Gin Gly Leu Gly Val Val Ser Leu He Leu Ser Val Pro His 
675 680 685 

Met Ala Pro His His Trp Arg Cys Asp Leu Thr Asn Ala Lys Pro Trp 
690 695 700 

Leu Leu Ser Glu Met Asn Asn Asn Met Gly Ala His His Thr Leu His 
705 710 715 720 
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Tyr Arg Ser Ser Val Gin Phe Trp Leu Asp Glu Lys Ala Ala Ala Leu 
725 730 735 

Ala Thr Gly Gin Ihr Pro Val Cys Tyr Leu Pro Phe Pro Val His Thr 
740 745 750 

Leu Trp Gin 'Thr Glu Thr Glu Asp Glu lie Ser Gly Asn Lys Leu Val 
755 760 765 

Thr Thr Leu Arg Tyr Ala His Gly Ala Trp Asp Gly Arg Glu Arg Glu 
770 775 780 

Phe Arg Gly Phe Gly Tyr Val Glu Gin Thr Asp Ser His Gin Leu Ala 
785 790 795 800 

Gin Gly Asn Ala Pro Glu Arg Thr Ser Pro Ala Leu Thr Lys Asn Trp 
805 810 815 

Tyr Ala Ihr Gly lie Pro Glu Val Asp Asn Thr Leu Ser Ala Gly Tyr 
820 825 830 

Trp Arg Gly Asp Ihr Gin Ala Phe Thr Gly Phe Ihr Pro His Phe 'Thr 
835 840 845 

Leu Trp Lys Glu Gly Lys Asp Val Pro Leu Hir Pro Glu Asp Asp His 
850 855 860 

Asn Leu Tyr Trp Leu Asn Arg Ala Leu Lys Gly Gin Pro Leu Arg Ser 
865 870 875 880 

Glu Leu Tyr Gly Leu Asp Gly Ser Ala Gin Gin Lys He Pro Tyr Thr 
885 890 895 

Val Ihr Glu Ser Arg Pro Gin Val Arg Gin Leu Gin Asp Asn Thr Thr 
900 905 910 

Leu Ser Pro Val Leu Trp Ala Ser Val Val Glu Ser Arg Ser Tyr His 
915 920 925 

Tyr Glu Arg He He Ser Asp Pro Gin Cys Asn Gin Asp He Thr Leu 
930 935 940 

Ser Ser Asp Leu Phe Gly Gin Pro Leu Lys Gin Val Ser Val Gin Tyr 
945 950 955 960 

Pro Arg Arg Asn Lys Pro Thr Thr Asn Pro Tyr Pro Asp Thr Leu Pro 
965 970 975 

Asp Thr Leu Phe Ala Ser Ser Tyr Asp Asp Gin Gin Gin Leu Leu Arg 
980 985 990 

Leu Thr Tyr Gin Gin Ser Ser Trp His His Leu He Ala Asn Glu Leu 
995 1000 1005 

Arg Val Leu Gly Leu Pro Asp Gly Thr Arg Ser Asp Ala Phe Thr Tyr 
1010 1015 1020 

Asp Ala Lys His Val Pro Val Asp Gly Leu Asn Leu Glu Ala Leu Cys 
025 1030 1035 1040 

Ala Glu Asn Ser Leu He Ala Asp Asp Lys Pro Arg Glu Tyr Leu Asn 
1045 1050 1055 

Gin Gin Arg Thr Phe Tyr Thr Asp Gly Lys Thr Asp Gly Lys Asn Pro 
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1060 1065 1070 

Thr Pro Leu Lys Thr Pro Thr Arg Gin Ala Leu lie Ala Phe Thr Glu 
1075 1080 1085 

Itir Ala Val Leu Tlir Glu Ser Leu Leu Ser Ala Phe Asp Gly Gly lie 
1090 1095 1100 

Thr Pro Asp Glu Leu Pro Gly Leu Leu Ihr Gin Ala Gly Tyr Gin Gin 
105 1110 1115 1120 

Glu Pro Tyr Leu Phe Pro Leu Ser Gly Glu Asn Gin Val Trp Val Ala 
1125 1130 1135 

Arg Lys Gly Tyr Thr Asp Tyr Gly Tlnr Glu Val Gin E*ie Trp Arg Pro 
1140 1145 1150 

Val Ala Gin Arg Asn Ihr Gin Leu Thr Gly Lys Thr Thr Leu Lys Trp 
1155 1160 1165 

Asp Thr His Tyr Cys Val lie Thr Gin Thr Gin Asp Ala Ala Gly Leu 
1170 1175 1180 

Thr Val Ser Ala Asn Tyr Asp Trp Arg Phe Leu Thr Pro Met Gin Leu 
185 1190 1195 1200 

Thr Asp lie Asn Asp Asn Val His lie lie Thr Leu Asp Ala Leu Gly 
1205 1210 1215 

Arg Pro Val Thr Gin Arg Phe Trp Gly lie Glu Asn Gly Val Ala Thr 
1220 1225 1230 

Gly Tyr Ser Ser Pro Glu Ala Lys Pro Phe Thr Pro Pro Val Asp Val 
1235 1240 1245 

Asn Ala Ala lie Ala Leu Thr Gly Pro Leu Pro Val Ala Gin Cys Leu 
1250 1255 1260 

Val Tyr Ala Pro Asp Ser Trp ^Set Pro Leu Phe Gly Gin Glu Thr Phe 
265 1270 1275 1280 

Asn Thr Leu Thr Gin Glu Glu Gin Lys Thr Leu Arg Asp Leu Arg lie 
1285 1290 1295 

lie Thr Glu Asp Trp Arg lie Cys Ala Leu Ala Arg Arg Arg Trp Leu 
1300 1305 1310 

Gin Ser Gin Lys Ala Gly Thr Pro Leu Val Lys Leu Leu Thr Asn Ser 
1315 1320 1325 

lie Gly Leu Pro Pro His Asn Leu hiet Leu Ala Thr Asp Arg Tyr Asp 
1330 1335 1340 

Arg Asp Ser Glu Gin Gin lie Arg Gin Gin Val Ala Ehe Ser Asp Gly 
345 1350 1355 1360 

Phe Gly Arg Leu Leu Gin Ala Ala Val Arg His Glu Ala Gly Glu Ala 
1365 1370 1375 

Trp Gin Arg Asn Gin Asp Gly Ser Leu Val Thr Lys Met Glu Asp Thr 
1380 1385 1390 

Lys Thr Arg Trp Ala lie Thr Gly Arg Thr Glu Tyr Asp Asn Lys Gly 
1395 1400 1405 
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Gin Ala He Arg Thr Tyr Gin Pro Tyr Phe Leu Asn Asp Trp Arg Tyr 
1410 1415 1420 

Val Ser Asp Asp Ser Ala Arg Lys Glu Ala lyr Ala Asp Thr His He 
425 1430 1435 1440 

Tyr Asp Pro He Gly Arg Glu He Gin Val He Thr Ala Lys Gly Trp 
1445 1450 1455 

Leu Arg Gin Asn Gin Tyr Phe Pro Trp Phe Thr Val Ser Glu Asp Glu 
1460 1465 1470 

Asn Asp Leu Ser Ala Asp Ala Leu Val 
1475 1480 



<210> 15 
<211> 23 
<212> CNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequoice: oligonucleotide 

<400> 15 

cgggatccga tgattttaaa agg 23 



<210> 16 
<211> 16 
<212> I3SIA 

<213> Artificial Sequence 
<220> 

<223> Etescription of Artificial Sequence : oligonucleotide 
<400> 16 

gcgccattga tttgag 16 



<210> 17 
<211> 19 
<212> ENA 

<213> Artificial Sequence 

<220> 

<223> Etescription of Artificial Sequence : oligonucleotide 
<400> 17 

cattagaggt cgaacgtac 19 



<210> 18 
<211> 26 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> DescriptiOTi of Artificial Sequence : oligonucleotide 

<400> 18 

gagcgagctc ttacttaatg gtgtag 26 



<210> 19 
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<211> 28 
<212> ENA 

<213> Artificial Secguenoe 
<220> 

<223> Description of Artificial Sequence: oligonucleotide 
<400> 19 

cagcgagctc catgcagaat tcacagac 28 



<210> 20 
<211> 18 
<212> rWA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligonucleotide 
<400> 20 

ggcaatggca gcgataag 18 



<210> 21 
<211> 18 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligonucleotide 
<400> 21 

cattaacgca ggaagagc 18 



<210> 22 
<211> 26 
<212> ENA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: oligonucleotide 

<400> 22 

gacctcgagt tacacgagcg cgtcag 26 
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<120> ^3c»wJ. TTDxina Ani3 Utes abareof 
<i30> PI/5-:*(W^i/A/03C 1962 

22 

<170> Fatairtlji ^^^^ 2,0 

<210> 1 
<2L1> 
<212> II'J^ 

<213> P5rjotoQchafcdu& Lvminesoer^ 
<;221> CDS 

<222> {4L2^ .<L6fe| 

<22&> 
<22L^ OX: 

<223> 0(rf2 -2a.lkI]Bt 

<220> 
-<i2i:> 

<223> corfi -20.7 toa 

<22»:3> 
<22I> 'CTO 

':222> (3342).. (411fl) 
<223> ori^ -2fi.V kDa 

<221> CDS 

<223> orf5 -17e kCftJL 
■c40D> 1 

eSpftttcatAt. gtrtatgaaat aaac&grttgg cgDaata&tt; a*!S^tatta tttctanttt 60 

gtt'cttat-a:: aatgatfltgc: t.r.wtt^aac ag^tAatga gtbtotcjctta aataaatcsct 120 

C'^erattUt catjgatstta tc^cg^tg t^c^tgia&c ai«btat.tiu:a taattagatt IfiO 

«3it:d.^i:^t3a r^tAttaas At^csi^>r^U etL-t^^UySKt ro^tatttat Gatcggtttt 240 

atto^witggy gsSiauSiftijXil-rt te^asitba^tt ttacacaatt cactmatagc cjattocatta 300 

acca^BAta Urjaft;S^;Sieit«*C: ttAtttat;^^* t-ta-tsgtont ofcatcttcat Lagcx:tactt i6D 

rtatanctoQ c«aaicibt0cuc abtttcaabc ceitgtatAaa t^i^t^ccas t atg ill 

Met GLn 

get oaa cqa grtt i?tt ate aoa ggt ac^ ggt gcc gta «vc« att 4W 
ATO Ala Gin arg Vial Vbl lie Gly Gdy Ha V3J "ttir Pro Xle 
5 1& 35 
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ggr. gaa g^at gtt gaa tea t;gt csa as^t att att gaa. as^ caa car. blS 
<oly GQu iisp Wi Slw Scr ci-SJ Trp Gin Set^ J it^ Ho du Lye (iln His 
20 25 30 

oca ctt CSC uj2^ att ana ttt cct ccc tosi ttc att aat L<ig cgt ttc Sfil 
Acg 1+*: His -ftrg lie Gilu Ete Pro Asp Sfer Phs lie Avi ."Scr Acq Ste 
35 40 45 &0 

ttt tct tti:? ctt aw.^ ccs^i 1>^;cr t/%t cacsg tta eta cca bbb 60D 

Phe sser Efie Leu ALa Pro Asn Pro Ser Arg Tyx Gin Uai r^Xi Pro las 
55 60 6& 

a=ra tta ^ct cat eca ctt tct sac bgc gga «m jgca Qca ttg 3Bg gog 6S7 
Ji^a Ijftij Tbr llie Itur Leu Ser Aep Cya CSly l^R Ata Al<j Lbu Ivb Ala 
70 75 

act tfiafc <3»R get Ctt erac om qg& ttc ggc gtg £iat ata tea cert gtt vOb 
Thr ayr GlJi Ala Ttar Gin Ala J?l:e CSly v&l Afift rjts £4air Pro Val 
B5 90 95 

g?s.K t4At li^f: 9*a»"- gtA att ctt ggc aot ^igt tgg 753 

Glu TiT lyr Asp ij^ Tyr dlu Cys <Slv v&l lie Leu Gly J3ftr Gly Tip 
IQO .110 

££cie5 ^cli «tt fi^t aat gtx! ^t f:3t ^t tec oaa tat £^ raa gca BDl 
<ily Ala rlB Asp ASH Ala Gdy Aap Eiis Ala Cys <Sbi TVr Glji Ala 
lili ISD 125 130 

aaa tta get cat oct atg agt aat ctt att svor: e^ty con 3*gc toe atig 
Lys Ala Pro Mteb £teir Asn Icu IIb lyu- Met Pro Ser Sar Itet 
135 140 L45 

ocir i^ct gra tgt tC5 att Dt-si tat gna cts. ogt ggt tat ciaa atfit ?kx? 
Thr Ala Ala Cys Sea: lla mt. Tyr Gly l^au Att| Gly TVr CIji Aan Thr 
IW 155 IBD 

iaUt ect cftcsc tgu gca acg gsc ciuc* atg gcc ata ggc cist tU. "9*5 
val >tet Ala Ala CVs Ala Tlir dy Tto Met Ala lift Oly A^ Ala Ftie 
Ifili 170 175 

gsaa att att age tea ggg egg gca &aa tgt at^i att qcc g^a gcc get 993: 
<a.xi lie Ilo ftr^ Soot Cly Arg Aid Lys Cys ttet lie Ala fii.y Al« Ali\ 
ISO 165 

E?3« tOG etc t^sg o^^j tgt aat att tgg aqt atl: ^^.t gta ctg aat 1041 
Glu Pjsl- li^i Tihr pj.-q Glu C;^"-5 Asn lie Tip ^ lie Aep V&l Ls?u As*^ 
195 200 210 

gca tta to? aaa gaa caa gcg gac cca aat <:tU 5?^ t(3t tgt cca ttt 10B9 
Ala Leu Sot Li's du Gin AXa Asp yro Am Leu Ala Cy^ cVs Pro Kiia 
^1^ 220 

gal. cierc: tt?t gga ttt gt« tto gcc gaa gpg gcxi gicj^ gt.l". 11 
^3er LBQ A^ Ams Ser Gly P)^ ^ Ala Glu fijy ALa Ala vsal vbI 
'^W 235 24^0 

tgt ctg aat tjal. gt?t tL;tt guu cgt ggt cjca aog att tta llfiG 

CvB Leu Glu Asm "lyr AEp iSer Ala lie Ala Aig Gly A]^ Thr He Ifm 

250 255 

tttt. 0^ szct tac gcc coba t«t tra gat gcc gtt aAl. Cta fivCC 12S^ 
Ala Glu rla Lye Gly av>r GLa Tyr Sear AiJp Ala VSal Asn Leu Ttir 
2W 265 270 
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egg CCS aca gaa qat att cct ooft ato tta 9^ ata act saa 90^ 1281 
Arg Pro Tbr GlIij te^ Xl«; cilg Pro He Lefa Ala He TOr Ly& Ala 

27^ 2 BO 2B5 290 

att ^gj caj0 gca cag att tog C05 «izk* gc^b atfc gswu tac att aat get 132^ 
Ilc: Gin TlI& t^hi iLe Sear Pro lys Asp He Asp rje Asci Ala 
295 30D 305 

rat- 93t s<2lt tet aca ocg tta ear fi^t ctt tar. «Dt c??^ gca att 1377 
Kis Gly Thr daz- Thr Ftro Lou J^axi Ab^ Leu TVr Gdu Ihr GOirt Ala lie 
31D 215 320 

^ce gca ctg ggc cbb. tat gcc Lat cag gta cct ata tea ajjc aca 1425 
1^ Ala Ala Leu Gly Gin 'Tyr Ala 'IVr Gin V&l Pro He Ser Sac Tbr 
'y2^ 'S^D 335 

^ tct tet acc ^ cBc ctt att get gcx: ^[cxn $gt. ?»gt ttt gaa aog 1473 
'H/B Ser Oin: Thr CQy RLs lieu lie Ala Ala Ala Gly Ser GLv Thr 
340 345 350 

dtt gta gtg nna ©sei tto get scei rat tgc ttg cca gca tbg 1521 
He ^tel VyB val lye Ala Lea Ala GLu Aan Cye Ptr? Ala llir Leu 
355 360 365 370 

aji3.t tte CJaC" C3^* ©cti gtlt CMXi ©at tgc etc aat tat tt^ C<H; mt 1S69 
Aa-j i£u His Arig Ala Aap Rro eye Aep Leo Asn Tyr ijau Pro Aan 
375 360 355 

cs>a cAt t^c: tcM^ occ get caa oca otc otc oca etc aat att /^igT? gca 16L7 
Glri His CyB Tyr Thr Ala GQn £to Glu val Thr L$-u Asn Ha &ar Ala 
:i90 335 iOD 

gigt ttc ggc ggg cat aac get gcg ctg gtt at-::: azw sjta taa 1665 
Oly Ptj-: Gdy G3y Has Asn Ala Ala Leu Vhl Ha Ala Lye V&l Aj?g 
405 dlO 415 

ctgatatgtt gatttttgca atg gaa gat act gaft csfct tgg tcsa aat ttc tct 171^i 

>)Bt Glu ABp He Glu Kia Trp Sev A-ici Eto Ser 
420 425 

ggg oat ttt aa^ coc ate cat t^t teg gcg aaa agr: gng t:ct ttg age 17£b 
dy ABp She Asm Piro Ue His Tyx Sex Ala lys I5er £er 1^ Aiij 
430 435 4d0 ^145 

dfcfl. oftg oflwi 031C cc^ qtg cag gga atg ttg agt tte etc tat qta 1814 
Asn He G2n Cbi [lie Pro WJ. G1t» G^v W*t Luu 3er Usu Usu 3yr 
450 455 460 

c«y* c^Q bbt tct coa eta act tec get ttt aca a«g gga ata tts? 1862 
Arg Gin Gin Itie Ser fiLo i^au Tl^ Sisir AJ^ Thr 'ihr (Sly He 
465 470 475 

arc Att gat qcc tct ttc cgc cag tat gtt tat acc g»o ttia ccc cat 1910 
AsD-i He Aep Ala ear Pha Ans Gin TVr Val 7yr rrhr Ala 1^ Fix* Kis 
flaO «35 4S0 

caa ctg a^ att aat aict: ^ a^. ^ tl.t ?V5W ttD aat ccc 195ft 
Gin Ltaxi At^ Jlr A-ai IBur Lys Asn l^ys Ite Phe lys Lera Glu Jtoi Pon 
495 S£lD 505 

agt aaa gaa aac aog ttg ttc ggc aat ace agcr gta oag aat aca atg 2006 
Scr Glu ASn Thr Lau Phe Qly Aan Ita Sar >al Clv Agn Tta Mat 
£10 S15 520 S25 

gag tea att gaa f^t to? ate gtt cag gat aat l;gt caa aaa eta aog 2054 
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Glu ©ar 2le Glu AEp Tirp lie v^l Gin Jis;g, CJy^ OLri L£u Olir 
530 535 540 

5La am ogu ^>aa gtt tgt tat gtc r.r_L: aga tzic tat 2102 

iLe itw cay Glu Glu eye Glu lv« ayr Aka Vol Pte Ara TVr Tyx- 
545 £50 555 

ttc ooa ajTt gtc act tct ott ggn tog ttc ctg gBt gog cstt get ttt 2150 
^i£r Pro ser val Thi: Ear rle Gly Tacp i*ifi Leoi ft*^ 

560 565 S7D 

cat ctt att ^tt ^Rt tcs aca tbt cct aat ttt gag Cftc tac mt 2198 
Hie Leu lie He Aen Ser Thr Gly T*ifi Laj Aoti Phe Glu Hlb TVr His 
575 580 £fi5 

bt.t A» ran ttft i?wg g?i.t fcnt cte ogt czin tct ttt act tby cat 5ict 2246 
ftvs Asn Gin Gin Psg> lyr Leu Ser Gli^ S^x Hib Thr Leu His ita 

GDO 605 

ggs= caa gcg att aaa ate agg aag gag att gtt ?3Hit agt aca gLa ttn 2234 
cay OliL Ala Xle Lys 11^ Arg la^ <3lIu He val Am S^: Thr Val Leu 
610 6S0 

tta tct ti3a cog ^t ate tgt gtt gaa tta ^at oct oct tta ttg abl. 2342 
6cr ^ PJ?^^ Ho CVs \fctL lisu j>hro E>ro Pxt> Lou Leu lie 
625 63Ci 65B 

oag aat ggc j^t aaa get tst att cgt att ttc bat tat cga tgt tto 2390 
Lys iftfin Gly TVr Il9 Aro He 5te Tyr Tyr Am tvs Leu 

640 645 650 

tat gat aaa aaa cc?t att tbt eta toa aag act tea att etc tct E»g 2436 
Tix Aee> liJs la^ Pro lie I4ie 'W&l Ser Lys Thcr Star lie He Ser Ivs 
6SS ggo fifiJi 

atCT aaa taa aai^^aagcg aaatgoGeac ACiS«wgtis»t attttcactg 24B7 

h^t L^B 

670 

oaataaooaa bsifiPKatatta at^tgaagg atatA^aiftsa toseqaaata acactaigwgt. 2547 

wtcttttgt ttcBctt^saa tttgatagtc ttgsactatst ggjaaaatocaa gtttttxjtgt 2G07 

l5Sertta$iSsrtft bagtattsjtg cttaaagccg aactttcttc jBWktcattct atttcaiiusiib 2667 

bftftfttowgct cactOTctat ttaaaat<M aattgtaAtc troattttta cttaftl:j;wibg 2727 

bbfctttxsitac atteaoatta agaggttata atg aac ett tta fgaa caa ggt- ftaSug 2791 

>3Bt Am y^l Leu Glu Gin C5ly Uys 

675 eeo 



qtt oct get tta tat tcA ^co Lat tc& gaa aosi r^a ggt tttb bcc t^ro 2625 
iOa Al& Le^ TftjT Ser Ala Tyr Ser Glu Thr Clu Glv Ser Ser Tm 
6fi5 690 695 

gta ooa aara ttig tgc tgt V-tt Uca t>gt cjot gag cat ti$ cot act 2677 
Vai Gly Aan CVs Cys Ite Ser Sea.- ABp Atig OI.tj t^jug Leo Pro He 
700 705 no 

ate ^ nat ^ i?gt cgt ttc ttg att gaa ttt gtt att cca gat cat 2925 
Zle Viil Asn Oly Arg Arg Ite Lflii lie Glxj Hjc5 itel lie Pro Asp Hisi 
"^l^ 720 -PZS 

tt^ Ct-t g^K a^sa «cg ^t aaa ccc aga gta ttc gijib tt^ gat etc aat 
LBU Leu ASIS Thr V&l I4« Pll> AiVJ Ufsl Phg Asp LBU JV> Asri 
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730 740 

zujo, oaa ttt ct^i ct^3 est cct C5>c Otit oCTt Q^g sta aat at.t t3U otb 3021 
1^ GOn Fte L£u Atxr Arg A«p Hift^ GIu lie lie Tyr JJeu 
745 75P 755 "7^0 

1J.A ijc^ afti: ti-l- 5it^3 9Gt csgg «co s^cg aca baa ^t^l rsto 106S 

Ijbj Giy Glu Gly Asn Ite Met Asp Arg Tl^ TJ^r' 7>ir Asp liys Asn Jjeu 
16S 770 T??^ 

ttc gag tta aat gag g^gt tea i:L3 I'Jt-t att owisi acq tta £?gjc car. 3117 
Wie oaus /fc=iai Gl.^ ftsp Gly £fer the lie lye Thr TiCTi Arg His 
7 BO "Jas 790 

Oct ctt ogc s»a tac get get act aat <x^. tos act acg oaa ttt ate 
jOa Uaj Gly lyB T^x- \MiX Ala Ilo ten t>ro oer Olir Thr dJji Ftaa lie 
795 600 B05 

ttc ttt gca oaa Don eizm tac agt gaa ttt ata atg oat ooc tta aag 
fte Phe Ala Gin Gly Lye Tyc ai« Ito lie Hat Aen Ala J«u I^ys 
{?L0 615 H20 

Eca ggtt ^aa cpc gaa IK^. tea oaa cct tat gtc aget ^*bt att cct 32tl 
Thr Vfel Glu ^Bp Glu Loi Ser Lys Arg Tyir Ars Vhl knj He lie Pro 
830 835 840 

gaa ttug «aa <5Ce tat tat gr^^ ttt gaa ctt gat att ctt tct att 
Glu Leu Gto Gly Rfo lyr oyr Gly Glu Leu Asp lie l£u Ser He 
(10^ a&O B5& 

aca get taa ttcacaatat tAtesosogt get atg gaa &ag eww Ata aca aca 3362 
Oltr ALp Met Glu Iva Lya lie Tlu" OSiat^ 

660 6e5 

tJtt. Mo att 935 oaa act gat gac aat tlit t^t rjct aat ggg C3gt 3410 
She ihr lie du liya Thr Asp Itsp Aan ite ayx Ala Asai CQy ftr^r His 
87C» BS^ 

c^. tgt e^tg gta oaw* ate tct gta ctt. ooa QStst tat agg aai, c^t 3458 
GOn Cys Mat vaL lays He Sex Vcti Leu Ivb GLtx Gbi Tyr Aju Am Gly 

890 a$5 

gat tgg ata aaa Cta goa ctt agt gpt 9^. n^oa noa teg att cac; 
A59 Tip Il9 1^ liau Ala lieu Sex- Glu ALa Glu Lq^ Arg T.lc Gin 
900 905 

gtc Gcji E9ca tta agL gat a^; lu+c wta tat »c cas f.tA waa «ta cct 3550 
val Ala Ala 1^) Asp ser Leu Xle lyr A$p CQn beu li^fB btet Pro 
913 920 525 930 

ta% qijt tgg aca aog aca c^L s;>a aga tvnt aaa ttt gat ctt ggg tta 3602 
iSfer Gi.ly Trp Ttir Itac Ita Asp Ala Ai^ A^sii lya Aep li&u Cly Leu 
93 & ^0 ddfi 

tta aat ggt gtt tat cat set s^t get ttt Al.l; ijctc ^ma cag gta 3650 
Ua? Asn Gly Vhl 'JVt Kia TJa At^ Ala Phe iLt Asp Glu Gin Val "itn- 
956 960 

fiat c^jt «50C3 gat t^; try; ara i3bat QBta aac cas oac agt gtg 369& 
7isp Arg A]^ CJly Asp Cys 1*hr Aan GJ.g IVr Gin Aa^ Ser Val 
965 970 915 

aaa agt gtt crrt geA att ate tat cgt tat gtn aet eigt aat ac^ aca 37dei 
bys aer val Pro Glu lie il^. Tyr Ar'g Tyr Val Ger fSta- Auji Arg "nir 
gao SB6 



5DOCID: <WO__©942589A2TL> 



W0 99/i2aaft pcTyEP9g/ntoi5 



aca gaa tzis:? cts. «tg i^aa «iti3 «M;?n ttc qaa a^sg odg 37S4 
Ster ^lir GJ.Hi l£u Met Ala l^fls Mer_ Piufi gIu Aep 'l^- Gly 
1000 lOOB iftLD 

aas cgc aca ttrs aca aca arit ate tea ^Ttt ggt gat. gtt ttt gac 3S4i2 
Aru Thi- 3«v^ T>ir Ttir J\f;h Wei. J^r v^|. Cly Asp Glu Val Pbe J^:a? 
1015 1020 1025 

arar aan atb tte tta aaa gcc att get cct tat gca stt Si^t. jica aat 38M 
ser wsl Leu Leu Lys Ala He Ale Fed Tyr Alu He Am "Tins: Asm 
1030 1035 104n 

raa tts oat S5!btt «E«? At;iC! aat- ai5w ttg ttt gat aaa aca g^a g«g ooa 3936 
GLa l£u Ills Glu Aflw lie Aen Olir Leu Pte i^^s ihr GLu Glu Em 
1045 lOSO 1055 

Bxxt aaa tec ©at act cat cat ot^oi atn att aat <rtt I'^t ogc tacr aca 3SB6 
Thr Li« J3er Asp Hn: lils icLs GIjl lift Asn TVr Ara Tlkp Thi- 
1060 10^5 

l^l;g wca Uat cat tlxr al:L ggg nut qac agt ad: ett 5»t 4034 

Leu Pro lyr Mis Leru Arg lie Lea Glu Gly tea At^ ^lir V&l Aan 
iQVb loeo iflptj 

aga ata tat gtc ctt ggt aaa cca tea a^jU sat afja etc cCg aoa 4082 
Ar$ ia»^ Ve.L X^j 0}y Clii ^ro Ser Asn Ajsp jig Phc Lai "llir 
1095 llDO 1105 

aqa ©ga aga gta ttt aaa cga gga act cat atg tga atocacatga 4128 
Aj:g eiy Arg val t^e lyi«> Axy Ttir His bteC 
IIW 1115 

taal:gtgagst ggaiggatgt^ tliateEymCit^ t^«;at«cc gtaactattic! oyyaLa cpca 4138 

S3cttgctj3ct gaagtgcttiT atgtfisacagg g^gctrtsbgrj ccoagtggtt fll.tabgRtgg 424S 

atatcati^C gtoacaatca ttgataacrta irjggT;t:etOTa cataaactta i^etAtttcttc 430B 

^caatatt g^gaogtspge taagrisstM^^ ^^gtotvaagt tgattttcci: tjagt^wusoaa 136B 

ccttti^tta tj^cjggtaaa cagcatgtsog ttl-^oc^ca attaatatat tissottattc 442B 

aaataogjaat ataaccatat ctgtaattat acatASrtcgia atttttactc gv^batoatt ^4BB 

ttmtt^itc EMacosj^^aa tctaaa atg aaa 1^ r^t ata tat t<:c aat 4541 

fcfet IvE Ala TJir tep lie TVr £ter Aan 
1120 1125 

Qct ttt oat tec s^gt tct tat att aat act ggt. gtc <Tat occ aga ft<% 45B9 
AjA rt^ Aerj Gly Stair 'Xyr 11© Asn ite Gly Val ^ Rro Arg Thr 
1130 3i:>5 lldO 

ggt aaa tat agt gca ajssU all, «Lt ?ttc ziro tts aga act. wist o^t 4fi37 
03y Gtii TVr 5(?r Alo Asm lie Aan lie lie Tte^ U?u Am t^o Aaa Asn 
11^5 1150 115S 

gtg cgt aat teg g^a^ ^:aa aoa tea cioc eta tea ttc trxj oca tto aca 46B5 
vnl Gly Asn Ser Glu GLn T*hr Leu Ser Leu gsr- Ftw sser Pro Leii rrtu 
1165 1170 1175 

acg tta aac aat cgc ttt gtjt. ett gric tog Qjc ttt tcA tUt ao2* Dca 4733 
Thr Leu Asn Asn Gly She Gly He dy Tarp Ar.g Pha Ser Leu -rtir Tlw^ 
11 BO 11B5 1150 
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tta gnt «ba aca ctt aca ttc age ca^a ^oa aat ggg g^g caaa ttt 
Lera Aep ll^ Ijys Thr Leu itir I^h^ Sor Azy /UjH Gly Glu GLn Ite 
1195 120Q 1^05 



4781 



cM^n tjgt oag ly:^^ »;Ug cicg crt aat gat ctt agt tr.t *flft gal- 

Lye cyB [jys Pro ubu ^Pro yco Asn Asa Asci Abp Leij iier ^ Uys Aep 
l^OX) 1215 1220 



aFirf aaa eta aaa gat ttig oac gtis tart ctjc qst a^c aat act ttt 
hys I^ys lijo Leu iiig VteL TVaf" 1^ Thr Jtac 

1230 1235 



txit gtt Lat osc Cbaa 93^:^ abt ^lA ^ba Ott ma C3(p att 
Tyr Tyr n^t lye Am Gly lie lie GLu lle Leu Lyi^ Axg He <3ly 
124S 1250 1255 



4325 



teg agt gat stt gca aaa aca g&t gca, ctt ^sm ttt cct gat ggt ^a 
S!5r Sex Asp life ^fe Thr litel Ala Leu Glu £te Pw> Assp Gly Glu 
L260 L265 1270 



Qca ttt gat tta att tat sussb tcd asT!^ tt;t gra tt^ Coc Qaa ata Ma 
Ala A^p L£u lie Tyx Asn Ser Arg Ite Ala Ijsta £er Glu ll<i i^r^^ 
1273 12SD 1235 



SD21 



cgt gtg bob ggt aaa act tst ctt aaa etc aet t*iC tct ggo ^lat 
a"^-!- An^ ^1 Thr Gly L^•B Thr TjpX Lau Lys Lea Aan lyr Sax Gly Asji 

13DD 



50&9 



aac tgt aca tea gtg gaa tjac cct gat gat. ast 3at att bet gog- awin 
Aan O/B Hit Ser Val Glu Asp Asd Am Am lie Ser Ala Lye 

1305 lai-O 1315 



5117 



ata 9ca ttc gat cat ogrt aac gat tac ctt att aog gtg act gta cct 
li^! Ala Phft ?4fip Tyr Atg Asai lyar ijai He Tte* Tbr Viil Pro 
1320 1325 1330 133S 



5165 



t-w? s^Xr tXit egt c?ct abt ©at t<?t grc cea ttt aaa at<3 acc tat 
Tyx Asp ALa Ser Gly Rno lie AEp ser JlLa Arg Lye itel: oiv TVr 
i:j40 1345 1350 



5213 



cag ac3 tea £;aa ggc gt& Ctt cca gtt atcr tbc 03c caca era 

GLri Thr Lous lyB dy Vol ite tro VSil lie Ser Tfer t*ie Thr Pw^ 
1355 1560 1365 



5261 



ax 5S0t tat <3tt gag ctg gtg agt tat aaa gag aat ggig cat <»aR gtg 
•sla: Gly oyr Vbl Ijeu Vfevl ger TVr litfS QClu Asn Gly tos lys val 

1370 1375 laao 



5309 



ac^ goD aog gaa tat att oct t£t ^^g pet gc^ etc act att 
Thr A^ 'TTix- Glv Tyr IlD rro Tyr ALa Ala AXq l^a ito lie Gin Pro 
133S 13.90 13&i 



5357 



aat gga caa cct grrg crtx^ «ia Wr; ti^t ^ fcat act tea gta 
Gly Asn Gly GlJi 1^ Ala val ser lys Ser Tyr Glu Tyr Ser Se-f vai 
UDQ ld(;?^ 1410 1^15 



5405 



cat aac ttc ttg ggc; oxt tut Ust ggrj! ^^yg cc^jl c»gc ttt oat tec agt 
His Asa Ito Leu GQy 'jyr Ser Ser GQy Arg Thr Ser J*Tfi A£p S£ir 53^- 
1120 L425 14i0 



54S3 



caa 9at aat ttg tat ttg gtc aoa ggg aaa tac act l;iit; tCtt tec att 
SlD Ai^P A^ Leo Tyr Lea Val Ttir Oly Tyr Thr oyr Ser Ser lie 
ld3S IddD 144^ 



S501 



gsa^ ogg gtt tts gat ggt Gaa agt gtg gtt ti^irt gta ata gnft cro 55-19 
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Gllvi Pixg VsL Leu Asp Gly (Sin Ster VW Veil Scr Vol lie Glu Ara Vhl 

IdSS 1460 



ttt aat aaa tte rat tta ^tg emc Aaa gaa AEia OiiA AAt 5597 
Pha Asn Xv« IHb Leu Met ^.to u^© Glu Ala lye Thr Gin Aep Aai 

1470 1475 

aag araa 5itt ocp SiCa gaa att: act t£>2 Aat ^.t ct^i bca «uia ei^t 5545 
Ty^j Arg He Ohr Thr Glu lie itir 'iVr Asn Gilu Asp Leu ser Ia^ Sto 
1430 1485 ld90 * 1^95 

tte tjC£i gag caa cca ^a aat tta cbb caa cct tct cgc gtg tta a^r 
ftie iSer Glii Oln Ero GlIo fen X-ey Gin Oljl Pixi Sdr Arg \ftsl lisu 'l^ir 
15M 1505 iSlO 

c^U tat acg gat ata caa aca aat act tea oga y^=^ ^ag act gtcr aet ??74l 
Arq QVr Ttnr A^ IL^ OLn Tl^ Atf.a'i Tlit S?n- Arg Glu GLu Wir V^l Am 
1515 1520 

att aaa agt oat oat tgg ESOSi awit act eta ctt att act gag ace agt 57&g 
lie Lyc* 9£r Afip Gly Am Tte^ l£u ijjy Thr Gflu 'ihr Ser 

1530 1535 15^0 



ggff at« cae aaa gaa tflfi gtt tat tat ccq ^ «at gisc gaa ggt aat 5837 
Gly riB <SLn liPs du TVr \fta lyr TVr too val dv Glv Ai^i 
1545 1550 1555 

U^TC: Q2t QDC set coc ttg gigft ttt tct ttc tta tea gtt 5665 
ser cys Pro Ala A^ Pro liau CSly Ite Ser Arg Phe L«u Lys Sftr Viil 
1560 1565 1570 1575 

^^39 <a&a aaa gga teg oct ©at ©ct get caa s*gt gtc oca Eat aaa gta 5933 
Tte: «ln Oly Ser Fsro Asp AUi Ala set V&l Ala Am^ Lye V&l 
15S0 1585 L590 

att c»t tat cwa tat cna c»4a ttt cct act ttt acc gcrc get tat ^.|- sggi 
rte [^lE lyr Thr Tyr Gdn lye T*ie Pro Tlir £t^ Har <A-y Ala OVr Val 
1595 1600 160S 

anc 03ia tet gtc wgt naa gtc tea gag noa ata gac aat aas ata oog 
ILi|^ Glu ov^ val Seqt Lys Set Glu Thr JJ^ A£5ip Asn lao lie Ala 
I&IO 1615 IfiSD 

Bg& acx? ttb tat s?tt aac tcjo ccc aoci aigit tct cat ^t teg (yftr; 
Arg Thr ser Tyr Vfel Aaai Ser Pro Thr S*sr ti'F Sar ElLs Gly 3er 
1625 l»0 1£35 

ttrt Qicia aaa ata aog tea gt^ *^tg &aU j^^c ^-^ig can aoj crtc escc aca 6125 
Leu Ala la^s II© Wu: £fer \tal Met Asn Aan Gin Gin Thr y&l Thr 1^ 
IbdO 1645 L650 1655 

ttt tat ^ tat tea gaa Ute iicc aca aat get ac^ gtg 6173 

E*Te li/B Tyr Olu 'jyr Set Glu Ser Glu tet ihr Thr Asn Ala Ihc Val 
1660 1665 lg70 

acc pc?t ttt gat gcc gca cat atg teg aaa aat gtg aca^ tc;t «itt 6221 
V>r Gly Phe Aep Gly Ala Hi* »it Glu SJdt Lys Asn val Tta: Ser He 
1675 1660 3 535 

tat afcc oat caa ctt cgt aiaa gtt Esat gta aac cac gtg att acc 62^9 
Tyr Thr Hia Arg Gbi Leu Arg v&l A^ Vial A^n Ed.s Vtail II© Tte 
1&9D i595 17^0 

gat cog tct tftt gsst cUt ttg ggt cgc att aoa ^ oaa att att cat 6317 
Aff*> Gin ser Jyr Asfp Leu Leu Gly Arg He Thr Oly GLn He Il^j Assp 
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1705 171 D 1715 

coc age «wg sP^ aga gaa att aaa ogt eat tac gtt "Lat caa tftU 6365 
pyo ely Ihr ALa Arn CSiu LLe Lys Arc Asti 'SVr Vtil 'lyr Gbi Tyr Pro 
1720 3.72S> 1730 1735 

ggr pxjt ^ gzia oat gaU »>gi9f cqg gtg atg ftt^^ gijit gfct tct 6413 
GOy GJLy Olu Aan Asp Fhe Tip Pro val Met lie du v&l Asp Sar 
1740 2.7^3 1750 

caa ooD fftc aofzjk cast. atx ci&t tac gat gga al^g gg« cgt att tgt 0461 
Gin Gly v\3l AD^ Ana Livs 7hr His Tv^ /fsp Qly Met Gly Am? lie CyB 
1755 1760 1765 

tog Afct ^ gaa caa gst ^ ^ sjcr? tog ggc aca tqg ggg att 
Ser He Glu Olu OlJi Aep A^p Aei^ <3i^ PdA Trx? GXv Thr Ssi: Gly Zle 
177D 1775 178n 

VAir. ggc aca tat ega aaa. gtt ctt gcc aga caa tat ^t gt-t ttg G557 
Tyr <3aji CSly "nir Tir Ais Ly& Val W« Arg GCLn lyr abp val Leu 
17^5 1790 179S 

ggg oag ttgs age aaq qaa att tea aat ijat tj^ tta tgg aat tl:^ tct 6605 
<5Qi'- Gin fifer Glu He Sar Asi Asp Tcp Ij^ itp Asn Lea 3ar 

i8c» 1805 laio leiSi 

noc €4at ccb tt^ gtU 09t Crtt c?5L. aav cog Ubg sjtt zjioa augs aaa acc 6653 
AJLfl Aan Pro Leu Vfel Arg Lea ALa T^ir Pro Leu v&l Thr 'ihr Tlir 
lS2a 1625 1830 

tat ana tat ©cat $gt dgg gga oftt Ctt trtO <agc eu^s tjaa tar agt gat 67Q1 
Tyr 'jyr Asp Gly Ttp Gly Am L&j •syr Ser *rhr Glu Tyr Sk?r AiStJ 
1835 XgiO 1B45 

ggt ogg ata Qa^ ctg ate <rat gat cct att acsg agg ^tt ouct G745 
Gay At^ Jit: Glu L«j Olu l.le Ki^ A^ PL^a ij^e Ttir Arc Itir xle Tto: 
1S50 1355 1360 

^ 999r ei^* aas sea l.l:fl aal- ?att cca caa aat aat ttc dVfiry 

Gin Gly V£4l IJVB <^lv Leu Gly Met Leu Aan He Gin GLn Af^i A&d ¥tvB 
1365 1870 1B75 

J3»0 cna oog cp^t teg otc e^a oct gtg tat oct cjat ggt aog atA ta.t &e45 
Glta Gin Red Ala Ser He lijst Ala v&l Tyr Pro ASp Gly Thr He Tyr 
IBBO 1085 1B90 1^95 

ogti cgt <act tat cet tat gat gga ttt gr^t c?5t aca qtg aog ga& tiR?33' 
Ser Tto Ar^i Thr lyr Arg Tyr Aep Gly rte Gly Arg Ttr Vfeil Tto Olu 
1900 1905 1510 

«ca ©at i#c:6 ijgt^ «;'-at set aw c%^. aLl gyca tott gat gtg ttt gat 6941 
TiTir Asp Ala Glu Gly Hia Ale Tto GLtl xle Gly Tyr Val Ttie Aap 
1915 1520 192S 

cgt ata «3tg aaa aaa acg ttg oca gcja aca ata tt^i gaa. tec ©ct ^969 
Ang He V3I lys Thr Leu Pro As5p Gly Thr He Leu du Ser Ala 
1930 1^35 1940 

tat gca age ttt age cat gaa gaa tta att t<:g gca rjig gtc «at 7037 
TVr Al?* Ser phe Ser Has GIta Gl\t hoo III^ Sot Ala Lea Asn v&l Am 
19d5 1950 135S 

ggc aas cag ttg ggg gca tta gtt Cat gat ggt ctt ggg egg $ta aba 7085 
Gly Ita Gin Lou Oly Alo Lcrw Ty"^ Oly Lsu Gly Arg Val He 
>9<>0 af>65 1970 1975 
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**gb g*iU «og gptg ggt ggt age saa aci? qaa tat tta tBt ggg act oaa 7133 
Sex Asp ihr ^totl <3a.y GCly Arn la^ llTr GQu TVr Leo lyr t^Ly Pro Gin 
1^^^ 19SS 

gfjt gcftc aaii fjf^g fttL ca^ tea att act act ln^ c«t aat cwm ccwx «Qt 75S1 
GQy Asp Lys lie Gin ter lie Ihr l>ro Sfer Eiis Aaii Gin Am 
lS9b aOOO 2005 

at£i gat tac etc tac tat ctt ggt a^it gtg atg tec aaa bfct- <^vo Ace 7223 
lto<t TPvT r.t^ Tyjr Tyi^ Lftij Gly S3er v^jj t'fct- Uys Ete Thr ghr 
2010 2015 2020 

ogg oca ^^^ac: r-?ij* ^5M^^ tut l-Sit cat bco aog aca tta 7277 
Gly <rta: Aflp GLr» Gin Am Fhe Tyr His Sea: lyss: TTir Gly Tta" Leu 
2026 :?030 203!> 

ttei ti^t gcg U"-^ 990 gt^ Let <;8?it3 act eiat tac agt tst ttc cca 7225 
l£u Ser Ala Ser Glu C51y Ser GOn Ihr Aan Tyr Sor Tyr Vn& Pro 
204O 7.0^ 2050 20tS5 

tc« d9t tta i2fisg fa^ Cc» Ut tta egg gat aat aaa ccg att 7373 
Ser Glv Vtil l>eu Gbi Acq Glu Ser Ite Ijsu Arg Aai A.<sri Tir^j F^rc Tlr^ 
2D60 ■ 2O70 

tcB teg ggp gag tac ctt tat aog atg toe ggt tbg att c^iii ogt cat 7421 
Sear Ser Gly GlLi IVr Lcti Tyr TJir Mat Ser Uly lea He GOn Arg His 
2075 iiifid 2035 

aaa gat agt ttt ggt cat ast cat gtt tat toQ sSHt gnt cat? gna 74G5» 
Lys Aep Sci- Hk; Oly His Asm His Vol 'Xyr Ser Tyr Jisp Ala (Sin GSly 
2090 2100 

aga tta gtc aaa aoa naa cag gat gca ca^ tac ^(id; ttt t«t ^517 
Ajcg ^i?&l lya «ir Glu r^n Atjp AItt can Tyr Glu TNo: 

21D?« 2110 2115 



gac; £at gtt ^^g ttg iita ^uoot ncg noca aoc aaa gac aog aog tea 7565 
Asp Aen \JtLL GQy Jira ijeu He Thr Tlfar Ha- Thr^ Eys Aei> Tto" Tta" Star 
^^12 D .1125 213 D 2135 

tta tec oas ttfl $1*4 wsa asm ate ss»a tat gat get ttt gat c^ ^sa 7613 
Ifiu ser GLn Leu V&l Tto IJ/E lie Glu Tyr Aiq> Ala Llia Ab!P Arg Glu 
2ld0 2145 2150 

ata ^a ogc teg eta &LL i^t sw? ttc tea ate* csa gtt att acc tta 7661 
He iing Sfet; bju He Ser Asp Phe Sex Tie Gin V&l rift T*ir T-eu 
2155 216d 2165 

aoc tat oog aa^ aat aat caa ate agt cafl cwiU «!:bc aicv; tec ate oat. 7709 
ser ^ Ttir Lys Aai Asn Gin lie iter Gin Arg rle ^ Eer lift A.-^ 
2170 2175 2ieD 

^^g gtg gtt ali^ aat oaa ogt tat caa tat gat aat aat craa <rgc 7757 
Giy v&l val ttet iJis Aan Glu Aib TVt Gdn IVr Asp Asn Asn Gin Ara 
^165 2190 2195 

tti age caa t^; r;^^* tgt ana gaa caa tet cog att gat cat acg 7ftDb 
Leu Ser Gin ivr GlJi cy© Glu Gly Rllw GJn 5wr Pro Ho Asp His "Jte- 
2200 2205 2210 2215 

ggt ogt gta tta aat csg cae iutt cac cat tat oac cm tg?? gga aat 7863 
Giy Art Afed Lau Asn Gin Giln He TVr IHb Jyr A.-5|p Gbi Tir^ Oly J«n 
^220 2225 2230 
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e^U. $JS^ egg ctzc g3Z eat t^t oga gat ggt aag ^ta ac3g gtg 7^1 
lie Anc Loji AfiM Thr rjyr Am Asp Gly I^ys Clu !Dir Val top 
2235 «40 2245 

tat cat ttc agt <hs goc gat ccct ciaa ctt att ocft Att- wdc «au 
7yr iliB ite £?er OUji Al« Aa$> Fro Hit: ^aln hsa lie Arg Zle fhr ser 
2250 2255 ^2^0 

tgen aaa cag sta tta $^L. I'^t g&t gpt a&t cbei oca 799*7 

Asp tos Olii GIjx lie Giu Leu Ser 'Vyr ALa Asn Oily Asn Leu Thr 
22€£ 2270 2275 

oQt qsic ^lajob casf. 9^ ^cg etc att toe ^at ca^ sat a^t <:gc ttg @04& 
Ai^ A.^ cSlu Lj/b Giy CSdn Ihr Ldu lie "JVr Asp Gdn Assri Asn Arg Leu 
2280 22$5 2290 i29£ 

gta can gte JM« s&c fagg ttg sjgc ast ctcr gtg t^ age tac neg tat 
VfejiJ. Qlri '/al Zya Aap Leu Uly Asri L«5u CyS fer lyr Gin Tyr 
23DG 23 D5 23.).p 

QSiC EOft tt5 aaa ctB ace gca cair ott tcjr oog aat gigft acc ^.t. 8.1-93 
AEp Ala bau Aai 1^ Leu Tte W& GOn i^^u iQa Asat Gly ^hr Vial 

2320 2325 

aat cga caq cot tafC tos Sigir aaa gtg acg aat ^itt oaa ttc -ggt 81B9 
AsTj Arg Gbi HLs Tyr Ala Sex Gly Val Ttir a^j He Gin Gily 
2330 2335 2i^0 

SS»t ^3^Ja gcg tgg ttg age agt e^c aag caaa ott oga cstt B237 

Asp du Aia lie Tlir Trp Leu Sbt Ser Asip lys Gin Arg lie Gly Hie 
2345 2SS0 23S5 

o!»n ggt caa trs gtc tac tat caa tat ggu ^^tt gac 3?3P5 

Gin Sea- Als I^fs Asn dty Obi Wl TVif TVr GOn TVr Gly lie Aap 
2360 2365 2370 

cat BBC E#3t ai^CT Ett ate ^oo agt ccg oiac gwa aac ttg atg get Bi33 
His A&ii S^r Tfhr Val He Ala Ser GLn Asa iSlu Aari du leiu fcfet ALct 
23Ba 23B5 23&D 

tta tec tat aca cct tat ggu ttt eioir etgt tta att toe tea tta fjog H3ftl 
Jjsa Sar lyc Pjxi 7Vi; Gay Pte Arg Ser Ufu Tie Sar See Leu Pro 
2395 240O 2i05 

ggt ttg aat age gra cao att gat cca gta ^kjca goc tgg tac ttc tea 
Gly Lfiu Am Gly Ala caaii Val As^^ Pro val Thr Gly Tip T/r !cte Lol* 
2410 2415 2420 

ggt cwc soA toi; cgt. stt tt>cr aac c»g gtt ate atg agg ttt coc asjc B477 
Gly Aan Gly Tyx Arg VbI She Am Val Leu Met Arg fhe Kis Sfer 
2425 243Q 2435 

coc gat agt t^3g agt cxA ttt ggt ogg gga o^g att aac cct tat aoc B52S 
Pro Afiv ^ Pr\y Fte Gly Arg Gly Gly 11^ Asn yro Tyr Thr 

2440 246S 2450 2455 

tat tec caa ggc gat ccc ata aac cgj; att gat ctg aac gijl cat ctt 3f>7:i 
lyr Cvs Qlii Gly Asp Pre Ilo Asn At-g Tlt^ Asp Lau Asn <ny Hie Leu 
2d60 2465 2d70 

agt goc ^ ^ ata tta ggc att gtg cte gro Qca att ggz atr: att B621 
.9er AJa Gly Gly lie Leu Gly He val lisu dy Ala life Gly He lie 
2d75 24 eo 24B5 

gtc ggg att gta tea ctg gcu ©ga ^59 9;?3 att age gcg gcjt etc e66J> 
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cay He &jr JjSoi Gly Ala qay Ala A2a lie Ser Ala Gily Lcuji 

2^95 250O 

rttt- gr;t gcg ggg gtjc get ttg ©gg s?3g att cs^rt tct ace a^c gc^ cim B717 
lie Ala Ala Oly Qly AJa GJy Al^ lie JLL^ fiac 'njir Sfer Alo liBU 

2510 2515 

gca gtt acL qgo tict gtc att gga tbg ©ct ry-c ^t toca ata !^ att S7ffi 
Ala Viftl Ala Tfcr v&l lie dy Leu Ala Ala Aap Ser IL^^ giy il^t 
2520 2525 2530 2535 

STO tea gc^ t:ta Sj^ gaa aaa gat. ccg aaa ^ina tct sgg at^i tta 0313 
Ala Bar Ala Ala Ijsj ESea: K^si Jjye Asp Kro Jjys 'ihr £3ter Gly lie Leu 

iSiS 2550 

aat tgs abb a^t ciog gga oeg tte ttt ogt otc gca 8B61 
Asm Trp lie Sex Ala Gly Leu Gly val ijeu Ear Phe Gly rlia JSe^^ Ala 
2555 25^0 2565 

aca aoc tfct ^c? l>r:L Urg ctg gt^ t<3g gcja cjg^ ogt cjijt tct cag B909 
rle Tto Ite llir Set aer Leu \tol Lys Ser Ala Arn Ser Gly £er Gin 
-^570 2!57!i 25S0 

gc£L gtc a^ gqg ggt gtt ate ggg tea gtg oct ctt ttt. cjet ^ gg^T 
Ala Val Srr Gly Vtii II^ Oiy Txpu Glu ]P(he Gly Glu 

2535 2530 2595 

gtt get. age ogt toe a^e a^ ogp t^ gat att gag tta tct. tc^ atA 9O05 
\it*l Ala JSor Ai?9 Ser ©sr Ai?g Arg Ttt> ASi;^ lis Ala Leu Ser Ser lie 

seoQ 2605 2610 

teg ttg ggc Ejca aat Qcg gag tct etc tct aog pgg ata gcg gcg gcg 3>US3 
Leu Gly Ala Aan Ala Ala Ser Leu aar ^Shr Gly Jle Mr Ala Ala 
3630 2«5 2630 

gcg gtt gca gac agt aat gcg aat gca get aat att ctg g^a tro ijta 9101 
Ala Vol Ala AsiP Ser Aso Ala Asn Ala Aia Asn He Leu Gly Trp Val 
2635 26d0 a645 

toe ttt OTt ttt irut 53c?a gta tog aca acc tea gga ata aIjC; e»9 ^tK- SlAB 
i5iear Pte Gly Phe Gly AL51 V^l Ster T^r -mc Star Oly lie xle gLu Leu 
2650 2655 26fid 

aoti cet get tat gra gtg aat c»c cag act ti^ gaa ctg ^jcyt bca <*197 
•rtar Arg tbr Ala Tyr Ala ViJ Afit\ Blx OIm Thr T!n> Glu Leu Sea: Ser 
2665 2670 2U7S 

tea oca ggt oct bcc gac gon gta fl^a-si oct ata ogt tgt etc gtt Uii 9245 

Ala <Sly ^Shr Ser Glu Glu V&l lye Ptd xlk^ Are C^s Leu VtLl Ser 
2G30 2685 2690 26$5 

cac cgc t^o oat ca^ aag cag tga atgttaaocc tcctcgggca gttsagtfceM 9239 
His Aig Tcp ASM Gltx f/i^ Oln 
2700 

tceaacgttt cgaaatagta ocgggaacta tttagocaat cgtccftbtga einjcocgtaat 935? 
gtgttgcgac gtogtttgac aatataaa^ ttctgogaar: cigattegtta ^iCTtctraopia 9^1S 
aaaataaizta ttaggogaca tttgogtogc cttttttaag g^Siac?tbbatc OEjgttaocitt 9479 
tat^gaagc tattttgttt togacggstg ttggtttcrtc: f^ftsabaaaa aiat^iaaasiisa 9539 
aaUg&tgtx^a agggtgataa tggttasttg taaa&tfttgt gatattattc njcatttatat 9599 
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^^^tgtaa ttoztcttiat^ tatt^tiziatbt tottgoattt- ^Asagcc^ atogccttat ^£59 
Aar.t.i;rjatt.t ttstasiatt attAt&toat o2itta»»cta aaataaatta tttctHga d717 



^21D> 2 
<^L2> PKT 

<2i^> I'lv^iDrhalDdaB luminescenB 

rk?t <:^]ji At^l Ma Gin Arg V&l val lie Thr Gly MeL <31y Alu Vol Thr 
15 1(> 15 

ETD lie <Aly <Slu Asp Afel Glxj Qi^ 7tp Gin Scr lie lie Glu lys 
20 25 30 

gQi) .^xg ite Hia Ascu He Ftie Fro Mp Ser P}^ lt« Sbt 
35 4D 45 

Xi.rvi pte Htf! Sex Ph^ Leu Ala Pro Asa Pro f^eo: AKS TVr GLn Usm hso 
" 50 55 60 

Fro JJys I^'S Lff^ Hie Tinr Leu fi^r Cys dy Ala Jila Leu 
6& 70 7S BO 

Lya Ala 'Ihr Tyr Gto Ala Thr Gin >la Phe cSly vial Ju?ri He Sor 
B£ 90 9E 

Pyo Vfel cau Tyr nyr Ae^^ Lyg: rryr (Slo CyS. Oly Vtel lie Leu Gly Ser 

loo los aio 

GCb' <ILy Alo 11^ Asr? Aan Ms Gly Asp HiB Ala £^ <Sln lyr 
IIB 120 125 

Glrj Ala U-^ L^ Ala His Pro Met Sex Asn Leu He Thr M?t Pro Sbt 

ISO lis 14a 

£er Mat Thr fila Ala Cye Ser lie Met T^/tr Oly Icu Ani tSly T/^ 
145 150 15?S l^D 

Asn TTir Val t-fet Aia Ala Cys Ala Tlir Oly Thr Met Ala XLe Oly Asp 
165 170 17& 

Ala JbB Glu He xle Ar^ Ser Gly Axg Ala tfet lie Ala Gly 

IfiO 185 IdQ 

Ala Als dii Ser Lau Tlir Arg Glii CVf> Asjn lis Trp ser lie Asp 
195 20O '21^^ 

Ixiu A£3l Al^ Leu l«<s Olu Girt Ala Aep Fro A^ i^ji Cy& Oys 
2X0 21^ 220 

r=xT? HiG Sur Leu Asp Ana Ser Gly Ite \tel Lea Ale Glw Gly Ala Ala 
22S 2211 235 2i0 

val val lieu filu Asa •ry*' A^Sf^ Se^- Alzi ILe Alc^ Arg CSly Ala Tl^^r 
246 250 355 

Iln? flXfl alu T.l/zx U/s Gly 1^ AIa Ciln Tyr Sex Asp Ala VsjI Asn 
2G0 26S 270 

Usu ihr AX? Pro 1^ Glu Aap lie Glu Pro ly^ Jl« I^em Altt He ^Jir 
Sr75 2 BO 2B5 
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A]^ 7l« GIm Cain ALa (abi lie Set Pro jyr lie 

290 295 300 

Aan Ala Kie Gly 'ite Sex Thr Pro Lou ten Asp Leu lyr Glu Ttit' OIei 
305 31 U 315 320 

Ala lie ALm Ala L^Su <3Jy (Sin AJ^ TVr GLn Val Pro lie Set* 
325 330 335 

Set Thr ser Tyx lOir <31v His Ijbu He Ala Ala Ala <3Jy Sdr Rie 
idD 345 350 

Thr He Vial eye V&l lye Als Leu ALa Glu Asn Cys Leu Pra Ala 
365 265 

'i'hr L^. Asn Ltsu His Pnu Al» ASip Pro Asp Cys Abd Leu Am OVr Leu 

:^?0 37S seo 

Pro /len Gbi itls Cys 1^ Tta: Ala Gin Rno Glu v^l Thr Leu Ami lie 
330 395 400 

Scr Al« Gly Phc; Gly OJy Hii; A.':rt Ala Al?^ ijcu Vtil lie Ala Ivf? 

405 410 415 



<210> 3 
•:212> Hftl' 

<213> PbotDrhabduE luntneBcenB 
-:dOD> 3 

kfer. Glu A£4> lie dii His Tj^v S5e:r A^;n T^i^t J&sr Gly Aep Phe Tlsiii Pr^, 
IS 10 15 

lie Kis Tyr eesT Ala 14^ ©ftr <l3.u ^;»-r. Tcti Aru Asn lie Gin RLi Hiy 
2<> 2[> 30 

irro V^l Gin <ny l-fet Lei3 Ser Leu Leu lyr Vial Atig Oln GOn l*te £ter 
3ii W 

Gin Tt^i- Ala Pte Thr Tlur Oly He Leu Asn He A.sp Alji Ser 
60 55 ^ 

EliE Arg Gin Tyr v&l Tyr Wir Ala Le^j Pro Kls Oln Lbu atb lie Am 
6S 70 75 80 

•3hr Aa^ lye Thic Plhe ijaii Glii Ajjn Pr^ Secr lao Aa^ Tht 
65 SO 95 

ffiB Cdy Aen Thr iser val Glu Asn Thr Met Gl.U &Sx lie Glu flep 

100 105 no 

lip riG \Ail Gin Aap Am Cys Gin Leu Thx XUl T)ir dy Glu Glu 
ll-*^ 120 125 

Vftl Cys Glu Tyr Ala Vhl Az^j Tyr 'IVr SS^hB P^o SeL- VbJ TlTr 
130 13S L4D 

Ser IlD G]y Itp a» jjEU Asp Ala Leu Ala Fhe IJlef L^^ij He iig Asn 

X55 160 

SSer Tta: Gly Ptoe Lieu Asn Phe Glu liie oyr Wijj Eto Am GLn Leu GLn 
1« l^O 375 
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A-ip TV'* GlJ*! Ser Ite Ohr Hjsu His Tlir Giy tSln Aia lie Ijya 

IBO 190 

Xle Arq X^'s Glu ILe V«il ten Sex ITir V^J. Ij^u Lktu Sfar Ser 1^ 
195 200 205 

71e c>r'^5 ^Ail Leu A£n Pro Pro Leu Leu lie lya ten dv As^^ lyn 
210 215 220 

tep Ti^x lie life Ei^ TVr Tyr Arg eye Leu Tyr ABi> I.ys Pro 
225 230 235 240 

lie ftie Ifel l^; Hie i^' He He Sea: lyn UM; lys 
245 250 



<21C> A 
<211> 1S6 
<2a2> PFTT 

<213> C^^otcrhnbdus luimnsBcens 

>4=-t AETi V&l l£U GQu Glrj Gly Ala Ala Leu Tyr Ser Ala TVr 

1 10 15 

Sfcsc aUi Thr Olii <31y Sear fS&r Tcp v&l dy Asrk T.e*;^ Cys Cys ihe iiEr 
20 25 3D 

Ser Asp Aoi Glu Htis Leu Pro lie Tie VfeiL Asn dy Arg Arg rbe 
S5 40 45 

rie GLu Pfce lie Enc. Asp His Leu Leu Aep Ly^ Ttr VbQ Lv3 Pro 

50 55 60 

Ars V«a P)>e ?*st> Leu tie ASii I^fle: <3lri Hio L^u Lffu acq fa^ Aep 
6S 70 75 m 

hlis Atb Glv ^KSn 31c Tirr Leu tru <Sly CQu dy Aon Phe Jfet Am 
B5 90 95 

Arg Ttir ihr l1nr isap Lya Asn Leu du Leu han Cplu ASP Gly Ser 
100 105 tlO 

Leu Pite J'x^. li/H T*u^ leu tea hls Ala Le« ivr val aLs iie 

lis 120 

Aai i>ro 3sr Thr Ihr Giln fell© ri© the the ALa Gin Gly Lya TVr *3er 

13b 140 

<Slv^ fl*= rifi ifet Al£ leui Lya TVnr v&l Glu Asrp Gki Lew Ser 
14& 150 155 160 

Jirg Tyr Arg v&l Arg He He Pro Glu Leu Glii Giy Pro Tyr OVr GQy 
165 170 17S 

Olu Asp He Lau Seor lie Thr Alia 

leo les 



<210> 5 

<2ii> 
<2i2> pm 

<2L5> PtotoylwMiiP lumb^BoenB 



50OCI0. <WO._ ©942S89A2TL> 



- 16- 



<400> 5 

Mat cau I.ys Lye He Thr Thr Hie Thr He Gdu ly« 0*^.' Agifii 

Asp A£?ri oyj^ ALa Aan Cily Anr His Gto Cys >!iB(t Vsl lys lie Ser 
IS 20 2^ 30 

Vbl Leu Lys OLn Glu TVr Ars A'Sn aly Afti^i Trp ILo I^ys leu Ala JL^ 
55 40 43 

Gau Ala Giii LvB Arg Seof Il3 GOji Ala Ala Lea Ser £er 
50 S5 &o 

Js^Ki JJ>5 lyi: GCLu Leu I^/b itet Pro Sear Gly Tirp Thr Thr ifhr 
65 .?D 75 

Ala isra Asn Lq^ P4>«^ A-VP Giy Leu A-sn cSly TVr Hia Ala 

85 £K> 

?iflp Ala Shfi He Aap Glu can VSd *lhr Aap An; Ala Giy cy?? qysi 
95 idO l» lao 

Thr Asn Glu Aan Tyr Gin Aen Eer v&l Ear VaL Pro Olu lie He 
115 120 12S 

Tyr Axs Tyj* Uial Sea: Sei.^ Asa Arg Ota: Ser Hir <5lu Tyr Lbu Met Ala 
130 135 14D 

Itfs Met Thr Pt* GQu A5SX> Thr ASQ^ Qly Ly^ Acg Thr Leu 'Ita* Tto Asii 
li5 150 IBS 

MEt Sex val dy Asp Glu f^ie Aop Ser lyB \^ l£u l£u l^s ALn 
160 lfi5 no 

lie Al^ PK> lyr Ala lie Aeo Thr Aen Gin l£u Hijs ciu Asn lie Aen 
n5 130 135 190 

ihr LGUk Fhc Asp lys Tte- Gllxi Olw Pna Tbrr I^ra Ei^r Asp •Itu: Hie 31ie 
195 20O 2g5 

Gin He ile Am Ubu "lyr Ana 'itp 1*iir lieu i»LX> 'lyi His Arg i"ie 
210 21& 22 D 

Lefu CUu OXy A^n Ast> 0sr Ihr Asm Atg lL<i Tyr V«l Leu t^y L&?s 
225 230 235 

Glu Pro Ser Asci Asc^ F^ie Leu 'i^ Arg Oly Anj Val She Ly^; Ara 
24D ids 250 

Gly Thr Eiia Met 

255 



<211> 15E4 
-:212> PRT 

itetcrhabdoB lumineacfifs 

<4D0> 6 

*tet AAA Hir Awp 11<? Tyr S<*.i^ Jteai Ai^* Pbc Asn Gly aer TVr 
1 G 10 15 " 

lie Asn 'ihr dy \fatl Aro Arg Thr Oly Gin 'Vyr £3er Ala Aen He 
20 26 
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Am lie lie Itsr Jjbu Anj Kro aeti Am Vtel GLy ftEoi Sftr Gly COn Ihr 
:S5 40 45 

Leu sej 1^ Eer Ehe Ber Pro Leu Ttu- Thr Leu Afui Gdy Ite Giy 
50 55 60 

Jlc G3y T/p 5he J3ei^ 'rt^ Tt^C LtXi Asp XLe U^fB Oto lijw Thr- 
65 7D 7!i 80 

Ite SET Ptq Ala Asn Gly tslu Gin the lye eye Ij^-t Pro Leu Pro Pro 

«b 90 95 

7^ Afin Asn /iep Leu Ser Ite Uys A£p Ly:> Lbu Asp L£fo At^ 
100 105 XLO 

Vhl lyr Liys Leu ft.?r £5ijr Asn Ttir Ite ^JVr Val Tyr Asrk T^{: fcsn Gly 
115 IM 125 

lie lie Glu lie Leu lyE Arg lie Gly Sfear; &5ir Asp Oe Ala Lys TVor 
130 135 lAO 

Vtel Ala L:;u Glu Pfio Pro Asp Cly Olu Ala Pbe Aep He IVr Asn 
145 150 156 IfiO 

Ser Arg Ete Ala Leu Set GLu He LyE Tyr Aixf 'Jbr tSly i^b "rtir 
L65 170 175 

Tyr Leu lys Leu Aai lyr Ser Gly A-m As^n CSys "Ttcc Ser vaQ Glu Tyi- 
3180 1B5 190 

Pro AfTp Af^ A£» Afm He Sot Alz!i Ly& lie Ala Fihe Asp lyr Arg Asn 
135 20O 205 

Aqp TVr L»u lie Ihr Val Thr val Pro oyr Asp ALa Ssor Gly Pro lie 
210 215 220 

ABp SET Ala Ara fite L^ Met Tliir oyr Gin t^it Leu L?/b Gly Val Cte 
230 235 240 

Ito Vial Lie Ser Thr Pfhe Arg Tin- Pro "nsi.* Oiy IVr v&l Glu Leu vaI 
21S 250 255 

f3er Tyr Lyet Glu Aen Gly His Lys \tel 'ifcr Asp Ohr Glu TVi* lie l»co 
26G 26s 270 

oyr Ala Ala Ala l£u Oi^ G\ri VX^ Gly Asn Gly Gin Rtd A]^ Vol 

275 230 235 

.5?r Lye Scr 'S^r Glw 'Jyr Ser aer Val Hie Aan Ptj#- Gly 'iVr Ser 
290 295 300 

iter Gly Aru Ohr Ber Pbe Asp Eer Sear Gin AfpTf* ten Lew 'I'vr Leu vsl 
ADS 310 315 320 

Wir Gly JA^:? lyr Tbr TVr ter tser lie Glu Arg val. Aiv Gly Gin 
32& 330 -^35 

^ Vif)] S)&c V^l lie Glu An; PtiB Asn Lys Pbe His? Leu Met 
340 335 350 

Thr lya Glu Ala L^e 1^ Ola ^toi? Asn Ar^ lie Thr Olir Glu IIa 
355 360 365 

TJjT lyf Asar^ <5bJ Asi> Jj;;u 3scr Liys Ser lite Sei" Glu Gin Prtj Olu A^n 
370 375 JBO 
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l/AJ dti Cbl PJirt Arg v&l Leu Thr Arg Tyr OStt: As-i^ He Gdn Thr 
330 400 

Asn Ihr iJer Arg Glu Glu Thr Val A&zt lie la^ Ser Aap JSst> Tip Gily 
dliii 410 

ASfi Thi- Ta^u He -nir GLu Ohr Ber Gily lift Olti lyt GLu lyr VfcxL 
420 42& 4iO 

'iVr 'ivr Val Asn C^ly Cllu Gly ten Star Cs^ plo Ala Abd Pro t£u 
43S 4dO 

Giy Efe Ser An; l*e lieu Lvs Ser \teL lOir Gbi Lys Gly Ber Pjrt* Jk8a> 
450 45S 

Ala Ala Gbi ser Ala Aen li/E v&l xL^ Hl-j jyr Tta: TVr Gin liVB 
4^5 470 475 4B(> 

Phft Pm Pfcie T^hr <3ly aJjs lyr Vfet Ij^^ Olu iVr Mai Ser iafb w&] 
4B5 490 495 

Ser Glu Zhr He iian l^s He Ala Ar^ Thr El^ sei^ Vhl Asn 
SOO SOS 510 

Ser Pro Thr iSBT ser Hie dy Ser l£u aIjs Jay:? He Ihr Ser V&L 

520 525 

Met Aeri A«n Gin (Sift Tnr- Wsd The Tbr Rxj lira 'lyr du TVcr Ser Glu 
530 5^0 

tter du Met Thr 'Ita: Asn Ala Thr val Tto Gly P(j>e ASp Glv Ala Kia 
■''^5 550 555 5^1) 

Mat du Ser JUvra Aaa-j via], Ttar 11^ Tyr Thr Kia Ang GCLn Leu Ang 
5*5 570 575 

1^ VftI iL^ Acsn Ris \tesl He Thr Agp Gin Ser lyr Afi^i jj-u 
580 5fi5 

<5ly Af.^ t;»tf Thr Cly GLn He lie Asp Fro dy Thr Ale ArB Olu He 
^>95 top 

I^ffs Arg Aen Tyr >;fel lysr <5]j(\ ftyr Pri> Gly dy Aap GLu Asn Aa> F»T.fe 
^10 €15 £i20 

Tcp Pro VW ttet He GLu V&l Aq^ aer Gin dy Val Axg Arrj i^fs ahr 
^5 6SQ 

His ivr nsp dy Met dy Arg He cys Gtir ii^ caw du GLn Asp Aep 

5S0 

A^ dy -"da TOP fily T^r Sur dy H« Tyx GCLfi dy Thr Tyr Ar.^ f.y^; 
660 6*F> 670 

V&l Leu ALa Arg dn Tyr AW VHL Iatu Gly Gin lieu Ser Ive dw Jl^ 

680 695 

Ser AsR A^ Tcp l£u Trp Aen Lieu Ser Ala Asm P/v? Lou Stal Ana Ijeu 
690 655 700 

Ala Thr Pro lisu V&l rshr Thr^ *rhr Tyi^^ l^r^ Ivr Asp Glv Tm dv 
710 715 ^720 

Asn Lc« Tyr aear^ The <3lwi T^r Ser Gly ftrg iLe du Leu Gly TJtt 
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725 



730 



735 



KiB Jisp Pro Xle thar Arg Ttir iLe ihr 6ln c3Cly Viil Lys Gly Ltu Ci?;y 
74D 745 750 

tikt Uxi ASJtl lie oil) GliJ ten ten Ite Ulu Gdi-i Rrt:> Ala der ale 
7G5 760 765 

Ala vaL Zyr Pro /isp Gly Thir lie TVr Ser ibr JunS^ 'l^^^ T^r Arn 'iVr 
770 775 750 

Asp Gly ElTC Gly Aru TJir vt>I thof GLu Thr /ifi^j Ala Gdu Gly JCLe Ala 
7£!B 7^0 795 eOO 

Itir GLn lie Gly Tyr Jisp val Phe A^ Ttt^ I3i3 Val lye LyE Ti^ Tx?u 
j^Db 810 81S 

RfD Ji^ Gly Thr iLe Leii ^Sl.v Ser ALo TVr Ala Ser P%^e Btxt Kis Glu 
B2Q S2B B^O 

Glu Lau He Ser Ala Leu Am V&l Asn Gly TItt OIji Leu GLy AIm Lcu 
835 «40 fJ45 

V&l Tyr Asj:» Gly Leu Giy Aacg Vfel iLft Ssr Asp Tte Vial Gly GOty Anj 
fi^D B55 JSW 

thr <alu Tyr Leu Tyr Qly Ptt? GLn Gly Asp Pro He Gli-j £er 

tsfis 870 sr75 eeo 

lie Thr Pro Sea: Hie^ AStj lys Gin Aen Met Jisp Tyr Ijeu Tyr Tyt^ 
BaS 690 

(Sly JJiaar- V$X Met ^ l4yE Tl^r Thr Gly TSir Asp Caki Gdr^ flsai tte 
900 90& 910 

Ar^ 7irr His Ser Ufs ohr Gly Ttiir UsiJt Leu Ser ALa gtear <Slu Gly vtl 
JS-IB 920 925 

&r Thr Asn nyr aet Tyr Et» L»ro iSer GLy V^J. Lsw Gto Ai^g GJm 
930 935 S40 

Ser Ehe Lea Arg Aw Asn Lys t»ro lie ser Ser Clly Glu oyr liCu OVr 
345 950 355 96^ 

'ibr Met SSQT C^y He Gdn A113 His Lye Asrp Pha Gly AiSn 
9fi5 570 • 975 

Ids \«fli Tyr Ser OVr Asp Ala CSln Gly A^g Lbu V&l Lya Ttir Clu Gin 
980 59S 990 

A^^ta Ala Gin Tyr Ala 'Stir Glu Tyr Asp Asn Val GJy Axn 1^ He 
595 1000 10t£ 

Tbr Tta" Thr Hhr I^a Asp Tlir Thr set Lea Ser Gin i£t3 Tllr Lys 
1010 1015 1020 

He Qlu Tyr Ala tte Aep Arg dlu II^ L^ Arq 5er L^^ lie Ser 
025 1030 1035 inav 

Asp bte £er He Gdn JJc; Leu ii^er Tyr Thx Aan Asn (^Ln 
1045 105d 1055 

He Ser Gin Arg He T^xr Ser ile Agd GLy V^L V<iL >9et. Aen Glii 
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/iTQ Tyr Glji Tyr Aep Aan Jism gIji /srg i£u aar Qln TVx Olji cvb GLu 

lOBO 10B5 

cay Qlv. Qlji Sec^ Pro lie AiSf£> Hi.?; Olir Qly tag Itel Leu Aan GLn cIji 
1090 1095 1100 

lie lyr Hie lyr Asp Gta T:rp GLy Aan lie L^e Arg Akp iijsn ihr 

1115 nan 

IVr Jungs ABp Gly lyB Glu Tiir tel A-^r- "iVr Eli£ Phe Ser Gin Ala 
11^5 1130 1L35 

Pro Oln: <^ Leu Xlfi Aiig rl.i Thr iSfer; Aj^p tys Gin Gin rla Glii 

1145 

See TVr Asp Alft Asri Oly Asn liBu Hit ficm Asp Glu ty^ Gfly Giln Ihr 
1-155 1160 1165 

ll£ TVr Gto Aen Are Vial Gin lys 
1170 1175 ueo 

Cly ten Le« Vhl Cira 9er IVr GlnTVr Ala Am Lvs 0%^ 

1L9S i2(X) 

Aia ^3ln V&I 1^ Ala Am Gly T«nr Am A*^ Gin kis ayr A1& ^ 
1205 1210 1.215 

Cily Lye val nhr Aan ll£ <3ln l/joJ C5iy A;Sij Glu Ala lie ^nur Tm Leu 
i320 1225 1230 

Ser Sar OlA Ax^ n© Gly Eoifl (^n Eer L?ys Aot 

1235 134D i2d5 

^h!JS ^ ^ Tyr Gly 11^ Jisp Ki43i AsSn Ser 'ihar val il« aIa 
1250 1255 

aBT Gin Am Glu Glu l>w Ifet Al^ lio: T!)^ 

127& 1275 1260 

AXTT aer Lea lie sat- St-r lAW Pro <ily 1^ OLn Vol 

12B5 1295 

Pro Vi«l Tl^ GLy T!cp Tyr fhe Leu cly feai GJy i^yx Arg Vjil 
1300 1305 2310 

AenProv&lLflut*stAii3t4ieHiBiiErProAspi>^ 

1315 1330 1325 

^"^^^t^S l^le Asn ETo Tyr Tf^ ayr qys Gin Gly ftflp Pro lla 

^330 1335 1340 

Asn Arc lie Aap Leii Afin (3ly His I.eJu .Sor ALa Gly Gly Tie Leu Glv 
1350 1355 1360 

11^ Vftil Leu Giy Ala He GOy JJe Ue Wal <Ily He \tel Ser Leij Glv 
1365 1370 1375 ^ 

AJ^ Qly AIr Ala lie Ser Ala Gly Lflu 11^ Ala Ala Gly Gly Rla Leij 
^^fiO 13B5 1390 

Gly Ala lie Ala Ser Thr SSbt Ala Leu Ala \tel Thr Ale Thr to^ lie 
1395 IdOO 1405 

^^^^ ^ ^ Ala ser Ala Ala Leu ^ Glu 

lalD ldi5 1420 
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Lys Asp Pro LyB Thr Ser GLy He beu Aa-j "^cp Sksx Ala <aly Ijau 
425 1430 1435 1^40 

GLy val L€u Ser Gly lie ser Al^ lie Ttir I^ie^ Tlir Ser Socr Leu 
1^45 L450 

USil Lys sear Ma Argt Sea: Gly Sftr Oln Alu %M 5er Ala Gly >teL lie 
146D MSb .1470 

CAv ^^ar Vol Pro Leu CQw Hhe Oly Glv Vkl Ala Ser Jixg S&r J5t?r Arg 
ld7B 14BQ 14B5 

Axg liTP Asp lie Ala Lso uer Ser He Ser L£u Gly Ala Aisn Ala Ala 
14^13 15.00 

fier 1^1 sea? ohr <Sly He a1«a ^kla Ala Alo Val Ala Aep Ser Aiui Alrt 
50& 151D idl3 1S20 

Asn Ala Alw hm Gly 'itp Ser Ptoe Gly Pt^ Oly Ala V&l 

1525 1530 153S 

Sbt ^hr Thr Sex <*Iy He He <iLu i£u Ihr Arigf TOr Me Ivr Ala V&l 
1540 1505 1550 

Asn His Gin Thr 1^ Glu Leu Ser £er Se^.' Alci Qly It^ Ser du du 
1555 1560 

Vfe.L Lys Ero lift Argf cy& leu %i Ser hiB Arg 0^ Asn Gbi lire Gin 
1570 1575 ISBO 



<21Q> 7 
<2n> IB 

<2i2> mA 
<220> 

<223> r^tarip^cai of Artificial SequfiMSftioliocriiicleotijdEf 
•<40O 7 



O10> e 
<211> IS 
<212> TTO 

<213> Artif icli:il Sequencft 
<220> 

<i23> Deecriptlcn Oj; Artificial bK2L»ffic&: DLi^c«V*^lDobi(3e 
^40D> 8 



-:210> ft 
<21D> 20 
'-212> Ct4ft 

^213> Artificial SeopAncf^ 

<223> Description of Artificial EBqLke»c«:oLigonurleot:ide 
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■c21D> 10 

<212> 

<220v 

<22i> Deecripticn of Actzifidal deQiJsrK»;oli@ca3U£djeotid^ 
<400> 10 

ttgtaaogtg gagpcqaact gg 22 



<210> 11 
<2il> ^7^46 
-v212> mat 

<222> {15171). 
<223> or£B 

■c220> 

<222> (237 31336) 
<223> h|stC 

<22D> 

^222> (31393 > . . (3Sfi3S) 

<22?> fjorrfi 

<:4D0c^ 11 

i:ettsi2tg9& cx^tgeag^t tat^cctatc gtoa^taga acgaigiGCBcg ^^?tCoeits;!»tg GO 
ocaagsatgtg gtstatgcas qciACt^txi teUt»sBCOTL tawiacctcat ctatcgt.r^a 3 20 
gttca^gtg ^gsstaacog agtctaggiog ^o^teocgs oacooaaa^ caaaagcaac IW 
aogcaDcaagr aatggocgcr, c:i:gcfif&c**aer Sft^tetbsic tcg^cacaac BB«c«s^cas 240 
atcttLLctt gocBcaggtc &Atg^t^ t^zuwicta tts^zaaaaa ttggftftcc«lL^ 300 
ggcugtacaa octgcsgtoat ai^.c?tc*£:n^a tr^ca^gMs ^iccoctacat clDgcct«,Let :$eo 
sogctacacc g^rra^to^ aaagcattac ttagic^qgf: vijtogctoac tacgaaggtg d20 
gggtapjotct ctcocagoza tttatgtxsc -LgtggogttL cxxiscsicabg ctggaaaacg dSD 
cgogtggtat ggtcagtcag cCcBct□aa^: t^gfgctCt-jStC- gctacaaaat attatcgaac S4U 
^ftc&daatfi?:: ggaxsigcttta 2BJcaogctct tgtataaatca agrz^cagaB ctaatottoa MO 
ctoatctcoG catftGooaaD aaaaccaktccF canai^ct^ga tgoggi&sdfta ebcsstskctDO 660 
jiioa^tcx^ aULXjyyyijcg cziaticaogct ttgDcagcta c^aaaagt^A l^cgsbogoaQ 720 
tft^tCii^l^- gic^k'ga;tf^MW! ogggctatvg ^--ttttgcatQc ctoegttgct ggcclKasgocs 700 
crtgccct^cs agcatcacgt ctggcgggcfis cCgogctrjcjA tcGggopncc aacattttog 340 
atiCilxigfCtga tegcggtage cgftt^igggeg cgattgrxxatt agcnacaggt ^t^tCatgg $00 
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£att£.tocgc cagtgtgatg aacaDogaeg qyyrRamt cagtrs^iic^i g^^gicsct'ial^ li^^o 
gccgtogccg t/caggaatjgg gaaatecagc gLaaccstugcf isgaadc:^^ ft b aiitciewsMya W20 
tx^tgctca acttcaatca cLgig^tsgtac gDcglgaagcf fjgci^l^itts oogiUMigTCa IO@Q 
gixftsaaaar: ocaacagga^ <r:HgBCtcatig r^toaar.bgeiv bctoctgcaa CT^Eis^ttca 114D 
gtascxjBBgc gttgLactajc cggctacgGg gtcg!G^:LEi^ bgubot^ttsxc ttcx]aat,tl::t 
ac^cttroc cqt a oogcgt tgtctgatigg ctgaaatgiQfe ttaiy^isttCT gagacteate IXGO 
!3U(3e«:ciicosc nas^ccttatc aa*i«ccoaca cctsxacaroa aaoccatjgag ggiittar:tsBg 13^0 
c-b^tfl^Spft^ CtlWtsebg Bbttbc!ts9cgc euiatagnaan tgcccatctg aggtgggat^ lliflO 
aeKjsgc^cjsUJi; ggfteate^tsa oMsaooattt ccsttanjoQca acactstgga gpactgcxag 14^0 
«a^b09tt taattt-t«TO crcacssattt cb^accctocli agcaggtggt acaaribcp&^ci XSOO 
irrratb^cg^ bcat^^gtt. acattognna ctogzucce^ct rpifjtgrmrifi atctctctgt. L!36l) 
ca^gcctgtc attagatiaat gac!tBCCC^ ^U^dCsaetcgs ogtsggcaac attnymrya 162D 
rtsaacaaat cagt^cacc ttgocagocc tgttftryyiTC atatcaagnt otacaaj^Ca 16B0 
ttjccgtorba coi^gg^ai^ gaaacpg^t tagctjrpReig ctetanatco ctCR^csgratct 1740 
ctcAtQi^t caatqacagt ggtcaattcc agr.tjggattt taaoaatrot aagttTCtgc 18D0 
cgtttc2i!*3j ant-tKro^t aaogat^t^ gmcactgsv^ jSuAcastttc cc3ocaatgc3g 1B60 
^=>^bertcs5>e«it? wnifgaasDx:^^ Rttqco^tTt ttgagtgata t.t^etcb9;?a tottccccatt, 1920 
£s^ctLtf2*2gcc s-^L^utuToacc tcaattaaat accaaaasca ggctxxzL^s^ cgg jyi^ tea 1960 
acttttCEuTg aaCataLacrc rtctracogbo tgctctcttt sioctjgitctga o^tcagCtBU 20^1^) 
aacagagata tttcctLCtc tcgt;gac|ttKr cs^lXwOubeiD tota^eatat caacoctctc 2100 
ctttttcata atatgcaata Ur/tMC^^b c^iitto&tt catttagrta ttzgntaacta 2160 
gttatattac ctatgacgtsi Attataaatt ttfftteCCgc atcacaataig oc&ttCa«t 
aaatiaataac: gttgtgaSM^b astbgatogt b^nntocjtgt ttttacttag ccgttatbkt 2:360 
caacccaatt t^^gaco^li atOngwSgtt. {ncctgtcrttjg ccttCgttitt gatagatata 234D 
jstsitmxxxtzsi ttiLatatoca c^gcacbcag aimcr&ga^t;^ atgt:)tttc%t.t. taoctaacat 2400 
ttSuBbJtfogaSi toaacat/^aa cacactjcaaa tco^tat)9 i^iaiCMacscctl: agtagcaggt 2460 
tttaatstc t^ccgata.^ bbgtcnbctc tcttbtgnog aacttogcaa aaA^.g^ '^!??,Q 
gaca^ACtCt C^Uj^ftDoca ^cccBaa^c ttatatcttDg aaigcac^jt^ asFbgcoaazia 25HD 
9e9C3^b<!UX t9«t{^^»^ ccatattctg aaacgcgoca al^OCtoattb ocaaa^tgog 2640 
gtcc&bctte <.x?c«gDicwoc acctcatgcL ^ocagoasg gttftiAftbag cwoattCOBC 270O 
aatr^ogcCA ^OftftAtotzic ogccocaooc poaatttctc ccatgtLttix toct yoyt^ b 27W 
tAttbagctg aftCtbbatcc ccaogcooKr eatttacatg acgaaaactc t^bttAtXsat -3820 
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ttooatiioaD EK>c5stccs?oa tctanacitoa ttoortoctcB gccsgflaAaa tatnisitacc? 2830 
ssacicitttocn oewrttbctct gtctabobopac abgttsctaq a^ggt&r.t^ ?^c«?tcbgttc 2940 
aaggacaagc t^idyatiti^ac tctnnaOTat nttanstctc tgnooft#soga o^^gct^ftctR 5000 
gaggctttga agaaHattaa Eaccrtctgtcc aaggaogatc t^^bgnsasKlc tttgasgaat 3060 
attaascctc Lgtocaagga ocratctgctg gaqgcttt^ Ku^tattaa acctxrtgtcc 3120 
MOTaocjatc Ccrctgnnggc ttti^isyiaat attnsacctc tgLcc^w^ggzi cgatctnctg 31S0 
92jggctttQ» ^?5i^Uit;l:i?ti ^iC^bgbCC JwagcKJgzxtc t^tggaggti t-btgobagHat 3240 
atta^iw^crtc tgrtccBfi^gift ogauztigctg. gag^Utt^ DuoaatatCaa acctcl^^tct 3300 
«g]3&cg&tc tacagigaar^ tattgaaaf-t. ctattctotc tggacagcca cacstomjjto 3360 
atgi&aBgcgt tatccaattf. <:x3gogttt(Xt ^gcfttgatoc cBtatcacgjfe bgcttatcsM 3420 
^ogtgcgLB aggctgttca attacaggcii: cnggt^tbg aacaogtr^l tnE?t«Kaitca 3-4B0 
ctaflaaafDsa cLatogat^&a actaeaatat caagcPttcbt tottggaaal: twcattcttct 3540 
Utctcgocta arattatttEac tiatctt^jact gaagaaatta ctaucaatcae tgcaaga&gt 3600 
c?t«ta.t^*&g twwAttetgg t^tattaaa ccttctccaa tagBcwni3ioc g^tatctg 2fifio 
aaaagttatt ftcaatx';teA£J tgatgan ga j tttooogatt tnai^ttoaat aaciaa£7tata 3720 
ettcttocag aagaaga^^t *»gcnabtact gobcttocat cg(ic?gt«Ctoc tagtacacaa 2780 
aagacLatcg aaaatcctga LtfttiSKjrtecfc dbattoaaas ttaataa©tJt tettcsjtcta ISdO 
ttcaaagcta CaaacttaCc accgacggta liLaaetOTM tcctccgcag oftbcsaocwca 3900 
paattos^ata tDaataaapa aatattacaa aaaat^Uttc irtijttaacta n^ljafcAtgciia 3960 
ogttAtOTta bt^acact^ oactj^catita atactatg)::^ aggteuccaat tt<:Bi^aUAl. 4070 
tf^l-jcaftai^ecs ^ac^trtoas tcogtttigat cscjttitatttss attccsocxsa actgaaLggt dOW 
csaogatbttt ccebcaabai? racrc^gmt attgattiipa cx>rtaaj0cag t^ccascftajci *l-30 
tCTnatazizia co^tsicttaa acgtgctttt. aacctcgatg atotcticsct aaaticgacte 4200 
cb«a»€«itte ccxfcatocwt caatactacc gaaat/gat^ ct*3atj^Lat flgag5^U)Jtt 42*D 
lx^ti:5^tr;tcl. ?it-»gg7K«ifl.a attact^fjca gatatcc&rjtf: lugttasctat tgatgaartg ^3^0 
^fgttacrtigt t-gcjaa^i^t aOTtonafgnn acaaccaatt 1;AJU?tgaQat Lactoccgac 
^latedgjjtto cbctoattcxia cB^ctjctat gctgtcacta cjctqi^atiact^ tarscaafiag 4440 
tgi^gtgtct atcagttgtt t^t^t^i:!. aclgataboat ataacaaaac cc^tattJiccoDg 4500 
gaaataogiga atttactgi^ tflFwgtCCw: «atos30tUiD aagar.r.r.ioaK tmocwOTita 45G0 
tt^aaagct^ aagaagatcl. Bc^aww^cx: ^Asocmgiu^t tgcagegcrigu oaaggcksuiat 4620 
ctg^saaaaat tocc^fjaaaa ccBgcnacaa cljxwngaag aoaggaaaaa a^oxaws^ 4*B0 
<»ai3ctH3aata zxagctc^skEicsa qacccaogaa aaagcce^^ As^aaicctaga bgaggtcagg ^"7413 
aaaMtctac caazmoccat aLctccttat atogmg^jcg ccctKjcaatt accatct^gaa 
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cabgcf^ecst attccatacrt cfil:ol;i^!iC& gataatct^ wewrogBoac a^gaaaaatg i66u 
acagc^gaaa aantatggaa ctg^tgcrg^ aaa&fistccog tl:;a uyuutj ca acctjgs3at.tc jS2Q 
caaaaacaag ctgaajoctgt gigtjccagtBL tgcaczagic^grx ticr9caa3»K:ft agcgttjgat.t ^dRO 
taar^t<Tta coggDcttas cc^aaacfioc fLaagcx^tigi: L^tsbgaosizia g^cgpaacx ^Ddo 
tttgttatta irmrry^^^ aoocgaaaca actDgaaacaa caixaflgoqic a tgaogtatca SIQO 
acaacteatJjt oactoEiJCOC?g ttttactjoac tjgpgttaact cacCflQgtgFR aanotjcctjct 5160 
l-xstffbnctaa <.x;g«batttoa acaaai53wcei ttaacasacaj? aactatcggc l-nrogctotsT 5220 
aatx^ttgatft ASftatctfitCt E3»sajb«5Jcc DWOT.ttjcaq!3 caaaaactg& C-ttebct^c 5280 
trorcatoba bcgaceasoct isttgragCgg attaacstct caogtcaatt OTaoacctct &3dO 
cCTtoonggoft bttocsiioact oacteaagta ttgaoogcag aacctcw^ taoctatacc 5dO0 
cwabogg^sSrii aqspcrgc^bgc gatattazicc qccq ^ gctgg acacccaeaa gactaaocacc 5d60 
ctacat^cgt ccctggfebgft 9tetc9:?fsgt. gctijuyttasi paacBtacLB fcatbtattct 552D 
cataaccaaa aaj^tcG^a agcaagaaj=iA ■r;atHftoagt^w ttoHS^cog tgat^^tcCi^ 
tatc^taoc tatcgatcga taaccaagtt. tcca^-r^eucsa tteuiziactac agsg&t0^c?t 5^0 
Dztaactatog ctagtatcca actgLstatt aarxJ^ogcet tooMoatat gg^gf^^t 5700 
DuctjgtCTCvsa DtatCECcag cc^Lcattc ttisaocat^t gg^taaata caatfia^i^ 5760 
fcaoei®c5CK7tt. oooooootat. gi^ctaaactc cuttactar.c cascOTatta cBtqgpaLcogr 5^:?0 
Aogcft^O^Cd UU^ja^^agnc lusaaats^'g j^at^cgttgic tgcafttooat cagcoaaagc Ski5o 
caattaaata togataccgb ai9c»goLtocc tttskaatctt acctafiu2i£ste oiftbcgBacaQ) &!^40 
QtOf^otaatc tciqaaacocc cagcgtxl-^tr^^ OAtgacaoca ctinataaCiga tdKngg3it.ta 6D0& 
ac^tacctta tcqi^gtag taaaac«9$a 3t)€Aeuitc»3it attattjggcg C3s(tgtgro& 6D6D 
cacaataaat csca^ogaa^ taaattx^cco ^cboatooct g^gt^agtg Qbocashaatit &L20 
gactc^ccaa tt:aacciDCLa rftftaqpAtaot ^^bteaDOogc taattititcca j«»:JctogcctjC7 6180 
tatcbbatet ggctjooaaca aaaaaa^ggcg actaas^c^gs ^lauggti^acaa aBC3;;gccl>;7g &2^0 
gcfttAUatt ttbgiwict'jaa attajjo^c^-t sCccgttatg acggcaoctg gaata c Qiar:w &300 
gtC^OCttte AtgbcMiaoca E^aaatatcc gatttaaatr: t^^goMtaa aaoacsctg^ 
cuttaccgct caagcctt^ca aggcaigogat daatto^tfs; t^tgtLtta teeuObftftapeisi 6420 
g»tcaattaa 4U3S>tiltao»c Asaacacagta ccaataaaag gioc^^aLtatat oocttccaac S^QO 
at^l-jCUtK^Iss^ {nggoeizitozic aGCtj»aaat cacaaaccta arJ9c;tt3itaa acaigtCtQat 65^0 
ai?tAaeft5l-e Iv'i^ltg^tgt <:auitaaCcoc tatgcaqaaa g^^ztacgnnat cccttcatca 6600 
gtaaatagta ataacggtta \:^^tt(gqg^< gatgcccatc ti^agtfitggl-. 9tot03C032i G660 
aatacctxag ccatcaaa:^ OT^Stcctca tcaejat^ast tosaactctc acoaaggULii 6730 
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€^aattat.tc aCjaabggact tntaj^nccpa caKMigioaacc aatQcaacct gatigaai^iaa 6780 
toicCTt^sag^ ttggtgataa atttattiztt ijatact^ctc tac^cstatitaa ccocaatioat 
ttetogaat«k t^AAaattcat. ttaocctigitt tDtcobotatza gt^^fgaajcac taccastaaH 690D 
QnooMSSaG gfj^tgpctgtt Ctatci^cTaa agtacCacta actttjg£Laag agcctggttc 6S60 
ccdnEiccttr; cctctggctc tCBagaaatR tcxacaaoca cLggoggtga cattagtggt 7O20 
<i^:;«;-jtil:^cr- aLatt^tsa coaaERtoct gi3uog3itct.tc oatttaaaca atatttctat 70S0 
atggatgacc acuyLguLnt tgewscwtsrtl. gt-.tlx^ygg^i tattatctat teisfctacc^c 7i40 
attaatcatt coDjCwgti-fitt t^Uwit^gtg ftaagcc^aay ^tetcnosoa nmauictttt 7200 
rrtaQosniE^ nstfi*£i<s^l-eft ugtxxcri^cc aat£cg$xtx: ^(ctt:cyww«i!i aetCoaistitiit 7260 
C!5Qftt.tj»»^ fl^c<--t^siaat. aeatatcccc acscLgacal. t;l;catii«abm tganocaoct 7320 
t^tbgsitoUAi ocCttBtcigc attcggctgag aaBtttc^og etiaat^i^aift bgntcgtzKoc 7300 
U:4$gi&O^^Bg aacBitttcBg tattogtatt atcaaa^ag oggBfiactg^ Uuitgccctg 7440 
accctgcaxx: ataatgcE^ caassomoBta tatatccaqt ggj^^aacte ttgLatLtjOgLt 7!iQ0 
cttaetacsgc: Cattt43CDC!s tcaattoat^t agccssaocco ac^cggg^t a^tactatt. 7560 
ttj^gcatgg aoactragaa tattcai^cjaa octiazkattag gajgaagattc toctg&tgct. 762D 
atfl^aocaa togacttcaa c sagoqcoaac aacctctatt tctggg&aact gttctactac 7660 
aoQCC^t^ t^tto^too ziogbttQctg cacgaaimga acc&o^tga ggctsaccgx. 7740 
tOTctgaewt rtbatcl:^!^ cxxaSLtirogst tAtattstrai aCosstoaaat qcaacattsc 7B00 
<7^<fJS^lig Lccageccatc fsCAiij^g^ flcLEtgrJi^gfl iscgatgatcc ottosjattca 7B60 
tttssatcrtg ataccatagc tcaacatgat aeaatgnact. acafidgtatju cw^ctttata 7920 
cgoacoETtag atctgttgat cgaecgiggga gattaogcct atwgccauSitt gDogciOTZic 7980 
acactcgctg aagccaaaat gtggcatatg cagigpcactgc:: at;iC:t»ttg9g twtaaacct 8O40 
cfl&t^Ttaocac tcagttcacjc atggaarjgat- ccaeagrrr^ Aeyyagcoogc DQCtcttgaa 3100 
A&aaaficayg cacatgccaaa ft^&ti^»e^ ^r.nt-ac^ ac^ggncttcc tacatccaoa 8160 
g^igtctaaa^ atjja^tjraa aaca^tctt ttoccgocgc e^i^j:;^;^, ogti^atucta ^220 
ajDtozrtgr^ eigwasictta^ araaoggtug tsL^cctgtr gccataacKr. ctt^tafcteat 1^2^ 
ggtx^cctt cat^ttt^c tattttqg^^a ^r^CB^g ^Ux^6^6^aasc ^jctgctcasjc B340 
cjrc^:r.gt£5g <:cBgttri»s^;(iiL eiggbcjasjwwjt BwnbtttCcat CTj5D»tttat atcagtgtigg B400 
ygtbbcc;CU; etoiigi^tygn MBuogcoosjt aotatOTtca gtcagctaac ccaattoggr: ftdfiO 
tct^acatt^ uinant^bbat. cgi3»>cgtcna oatijogagajji? cantaaacac 9ctigr.tjgcBa BS2D 
wit^ggi;:?^ c^^t^tg^t attgiicoasit ctragcatDC ai^caaaac catccsagaci B&BO 
cbggatQcC;!) AMi»^y\-^t ^^tasaastcm ^auCx^cgi^ saakcccaaitc ^cgttLtgal: 8640 
^sactao3igi;% ^tt<;t.aic^ t^ftasacsatx; ^^s^gof^i)^ cwiaaacaagc aatOTJ=gtt^ B700 
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c?jtf3ctccca iKgctggcBX, ctctaca^jDC cttccbogcat. ockcatctrgi^ gggogcBigoa. 
ctt^jatctns 03cCX3*ciat cttcggcttc actraatgsjttj gDaacegttjg gggggcg^txt RSSD 
ci^caa^Dcs ostggt^tgi: mtggagttc tcociccai^tg tcatgaacac fis^tiu^^ftag^l. e$(<t) 
da^tcB^ datct^aigL^ L^^cc^cpg c^cgtcagg aati^j^aat: i-j^c6i'a«»l: ^940 
^-^O'lXJasiOT csnssgciigaa acaaatcgat ssctcaacttR gttcgctggc agugti^citrgi: 
g:vx9009cv^ tattigcHc^a Mocasjccta cw»aoc?C2inc iWOTSSCJOoac tcat^cacaa §060 
irtgA'Cscttcc tjgaaacBtaa gctoagtcwit cc»a^3gic?teu Sotocbgnct gcgtggtocja 9120 
ti:el>CC!efcciai tttacttcca ottctatoat btaua oyuL ag- ccc^jctgttt gatggcggaa 91SD 
Ai^^cctAtc gccg^^gac tBhaaaKtaoc gDabcooocLt btaxtcaaaoc c^^cgoct^ 9240 
caggqaaccc atpoocjsrttt WJtceoggst s(tewc*;tfcAEi fc©ctgDi*tct s^gcaca^tg 92DQ 
g^catgccc accb g aaoca yant^^iat:^ gLacrtggaeeg t^*9ttfl.c5gtac cscjtttogcijg ^>i60 
□OTO±>3tct. atec?Ott^<;t- acrcagcaai^!: aaatttatec tgactcaggst JXito^^ss&^ S420 
ttOTtgagta. ?i,s.ggttcags cagggroggc aag^raat^ etaagctgfl^ ^ttagtecc 9480 
c«t>acciSAl^& occcteta^a agogtocatt togttatcta ccimgBac&l: taigcaaogat 9S40 
t--acwt;^tt ctattggtaa asccc^o^ attaaaoaga Ci^gcrgtLBcr wt-groagra 9600 
■ccgctag^ac cdatca^^ga t^^raaoca Rttctotctt aoago^gaaa agcctX!l^gft?=i 9660 
tt.c?;?::Djaaa jjtl>y:a»acc attwjtyyUt bctcabggsa tgaiatoacai; o^tjc&gtt^c ^720 
CTact-gpatt tiCi^^cc^l^ s';aaali1:i^ts cqgjtLccf^ gcja.tciwiiaiit coatcpaOTt ^780 
iic^::tg&£ar. t^gcttccx: afLStgcaatt agtaaa^^tag a^raaaaa^ugrt cgiaiaaaasKIc 9adD 
Asacaacaacj ccatgctg^ ^agtctgaac gacatosttc tgcstattog ctacaooatt 9M0 
ogccaat^ac ^ttttaatc aagtgctaaa aoaggoocct. aagcgigiggcx; tijcaiasKJaat 9960 
ctttcatgcs aaattcacaa ^tttcagta ttacajg^act atjcatt^eycc aicwOTatjgag 10D20 
gcgcLBtcac gigg^cggggf c^agctttaa ccccc&ccgg gccgcfeXj^ Bfetggcogcsjc 1D080 
tgtctctgcc ^ttgcctatc tctgccOTQPc- qcq^tatjcjc tccgtcactc ©ccttaaact 10140 
hcaaca^cgg ogccqigcaac a^jcccatctc atctgqj^tg OTattgeaac gCt;«t9?«^Jutt LOZOO 
tjixigccgpcag cacccetttt ejgcgttccac attacgatga aaaa^teicc tttctcnroc 102GD 
cfloatQecg^i ^gcacoggtor ebewiggatf: aatcccsgogia csaatcoaca ttacagggta 10320 
toaactt«og caw^ccttt Efccroscttarog Ejdktwwsgttc ccsCctiics*Q aj;tcatttca lOC^BO 
C?ocgattQe>a atattggoewi occaoggucM^ ot^wxicbognc ^usucbggcem acagattttt lOddO 
ggctigiiwtii (lai/cticiei^t: ggracssagtacr atttact^ tW*to»oca coagcooogK 10500 
Lcegeaacoc gt£aagacaCc actjcaaacag cacaatggtt gctaga&gccr tetjgtwcac 10560 
caratOTtgA dC&afttttat tatcaatatc ggig?c3ogBgga Ujaoatroggt tqcijaagctg 10620 
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Ab»rtrtfttt*c tctccatjcca CBggizcgccg cagcaacsgtta tetaci&oacft gtgtatbacg lO&eO 
9C5,aacac!gg»: Bgncagraea afigttafaocg gtci^gatgg cagci^c^ ccejuocwrgcoQ 1C740 
AcCggtr^tt ctatctggtja ttt^ttacrg gcTgafisizgic^^ tfijat^cscKxtg nOTMK?slccac lOBOO 
cagc--&ntttc gactacaggt a^tggcttr. gtog2c^^ ocp^ttbttrc cjTttati^t 
ate^g^ttttga Qatt£3gtaac o^^ccgcttat gosgtcaggt sttttjfttiji^t ncrircsicctgti L092O 
aa£5Gtctai5i tr^i^D&asjata ixuMssstzsno^ acooaocaac gtrtggtttica ca^cli^Lac iQg^jo 
tc«satt«4t:^ crgpft^^jat^^^ «U;9oca0:?a t^gctgstatt c!rttcgtoga ^ggacac^ iKidD 
y^-vt^asc^ t-Jict^rogUi acttityuogti ocsttageatt googtatcag ^latttttcac 111{)0 
c^^OAiJXSAl-fiiei cgctc^tse casatogftt^ ^bgtoctoBc aaacttcaat gccattcagc 11160 
ecbeSCftft^l; flett9i&l'<?tA aacjuggiisoas gattcoccsso tctoctatat caa^taaag 
gflgrotggtg gtaccgctcic gracftaogtt t.'S;:»:jete«wkt tBoctoagat qDogtcactt llifiO 
gggaaaaaat gcaaoctttg toggttatcc ctrjflctteHi iawagtMCouc to^^ctggtgg llidQ 
aCatoaatOT E^garwxaa ctLgactgigBg ttatca£X!^ acfC^ggatte csssjggatEttc 1140a 
atai?tMgca tccBQat^gc sgttggacac gfttttacsoccf gcteewicsct ctQccagtgg 11^60 
aatatacLTO ccraogogog caactcgocsg atttaatggg agcbggacbt tctnatttac 11520 
tactigfatcTO ccctaziisagt gtaogtttat atigcoaatac ciCig(»9RM?9gc tttgccaaaQ 
^ftEwaatgt ^igrtgcaaccc gotgatatca cantgaocagt ataoggigygcftcs geat ujyUijL o 11C40 
MQtf^liggc atttflijbgnt gtwtcnoott. ocsnjtcai;!^ acBtctggtt gaaglu^dgte """'OO 
ogactaaagt Gacctgctg© vAaatcdgs ggcoosigaoa tttcggteaa ccaattactc il7fJii 
ttcqg^ct tagircaaaccfi gajfi^ceftCgt tt3«bcctDc toaajjtttat ctggcsogatc 1IR20 
tagatggcau oOTcocgact s^tctgatr.r. ^X^tbCmx!^ Dugatootct^ gatstcctcx: lieso 
tqaataaaag cggcaacggc r.txjgtscjgcac caagtMctct coccttcccs gcog^gtgc: 11940 
qttttoatca taoctgtaag ttacaaagteg cir^t^rteoa aogcttai^ gtxx^cca^cc 120DO 
bgatett^uis t^toccgcat Btgactccoc atcactg^ tbgc^atctg aoosacacaa 12060 
afl£Mgtggrtt. acrti^agt^cKi cCgiaacEiiira atatgrajggc tcatcacs^^'ir; rj^tjgogt-tacc 12120 
gt^^tccagc CC>agCl:cbgn ctgn^otgasaa sagnca^c actggat^cr SESac^^iiKbarj L21S0 
cagtttgttB tctacccttc ccggtafi&Cft cccstAtogca onoosjaaata ^aggatgaaa 1224{i 
t«^cggc^9 ca^att^tc ac a atact e i D gttatgcacs tggcgccbgg gaLg^jgi^j i;?3<?0 
CT^C5gas5L&J;L to^CQsattt quttat^sttR aaca^aaaj^ cagccat/cafl f.^jgiyiwaiay 173fiO 
iJ^agptsogcc ft^twojDR cax?ciOT3ic toaocxaaisg caacgcgcct ciaacfccacat. 1J4SD 
cacoD^opcc esacccaaggc aAii9e>t<K;:atj aai^ixviraDc iiccCiic^aCg fxxcaaaqca 12^ BO 
jssagcgcjcbga acteacatGa cTOgciSp3i:£[>i crxsaa^t^sa <:53c?gcca«aa ttcacatcac 12540 
oogc^tggr ocadg^oa&t go^cagaac tc;ac«K:cajcc t^cgatgacc aaaaactg^ 12 £00 
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atgzcaoDOT aatcicccatcr ec^isataaoa etobbato^ac aonntattgg cat^gcgatf; i:^650 
aocsagcttt tgoogfjtttt tcoocac^t. tt^^cgzioctg gcaagatggt cae*0^t;aCl:c: 12720 
t^tcaracc g^aaat^t aac£*gtcs:sat nctoiiutjaaa cc^ggcactg ai^n^sgfccBbOD L2730 
tgcLEiE^cfiig tj^»act53Lac gopanaosta ocasTtaoaca ggasaaaart ooctacaozm 12840 
tcaccgaatt topcccacac gtaogtoggt teicaocatac cgatagooga Uacctti^tgc 129CO 
tttggtcatc tot«igttgaj» 43gi3cgia«!ict ^tcattocgn acotatogcc agc^^^cr.^ I29<90 
oa&guasc7C£i ^unn^atbHcag n-jil;cc^C9 ^Lxrtabb^;^ tcoaco^cta aaax2aggtt;;t 
o93taiji3ttgtii toc.^tswL:ca&t^ c=%j^sCCit(Tg oaae^sftSftoc fftaccctgat acgtt^cscte 13OB0 
cfltsiagtbatt UjCt«>e»o^- l^^ttr^Jit^fiacc! jfSifliaajeKaijCM attaoggrtc arotAtCstoc 1314D 
<?9ttwagtts c^U^aCcl^g aQ2:gi3*Geiat& ccatbctsot ntta^gatts cwggttteigto 132D0 
ccogcagcg&es tatctttgct tatagcgctg aacabgtorx: ttfiotggtOTt cLaaaljctSQ 132G0 
aaatcctaaa tgataaa£4at agtct^jattg q^^gaataa acclu^gtoaa tacctoggoc 
g0nfv\iianhaq cgttbatocc ^sajoggscene^ atgcaaoc9CC atcgcaa^o^ poaacact^c l^^BQ 
wnQogutgat t^oCtti^eiCfj gagacaiacag tatttaatca a^iacacBCtA b^;aj0Giatt.t.g l^ddO 
&l'^grg)t^UAt cUa^tOt^ct- c^tt$bcxl^ ogsuctictigga arwKjcogafft boccaocaaa 13500 
cagattabct attccacagcgpc acbgg^fliftg aboEiaiBtctc g^oagctcgt. ogtroctsta 13E560 
ctgattaagg aacBQOogae cagtteCggc 99(C*5©oflK«ra acagagcaac actosoctcn 13620 
ogggcai^t CBcgctcacL t^atgcaa artar.r^cacrt cetGacscaa acocsggsate 13660 
oSKicto33Ct o^KDoaactca ^ocag^tatug att^e^ogr-tJ: t^c^bgaccccc gttcaaiuLir^ "l3"7-40 
oggatisccan. ogacc^toag ceccttaoia cgirLg^f^: ^^ctggoac^ ccaatc&rrac l^-BOn 
tgrorttt-tES ^ULtacc;^^ aacggraaga tgajczLggrtli/s t-tcttoarog ga&aaa^l:^!.. 13^50 
ctrttttctcc oicratct^c gct^^scgirag cgatLaa^tu ^MraiaagDCB atjocct^t^jg 13S20 
caraototca ggtctacgca cccgaaagcL ggatgcocsAb attaaagaaa aiiaciClWA^lfca 13960 
acxTtogriaoa graa^jaf^ogg aaagagtt^t at^acBOCX^ cwitostcaoc gaagfatiagicf)^; 1.4D40 
gc^bctgtuac cctogutcac cqocsgcfcgOT rffrW^afynra aagrtgcziEjtc aoccagccaa l4lU0 
tcaatctctc euAoggcagt ccoogttta:^ cooctcatag csOtoacattg actacggatc 14160 
gttadgaccg cgatcttaag caacagattc gfjCSfiicttiagt iasjtattcegt SAtggctttg 14220 
gcogtutact gcaagcA&ct gtrtDgilCAbg tua^m^oga agcctiggcAfii cjgtanocaao 14280 
hcggpgctct ggt^caaaa aligywis^it/? rxjaoanocjcg ctgggcggl'.u noaoasKmca 14340 
ctigaatatiga ceataaggga caacciyatat ecsttcctatca accctattto cbcnaognct 14400 
ggcaataogt cagtaatgac agtgciooggc 9^ca@gbstoBi c^goccatgoa ^(stTuxTcatg 14460 
lictat^tcc cattggtcga gaaatt^d^ iX3Cteocgc aaaaggcngff tbcxjguosaa 14520 
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<xttttstfcet(^ tjocttggctt: actgtcaatg aag^bgeia^^ tpekcoioe^gut cMTt^ggL^a lASBd 
ja^gCaafQ^R yF^acfiftF^tugt aaagaaggtia aagEKsggt^ agatsbsiatt. tf^atcaatcc 14£d& 
ogcccaggCt:gF Aagggoggga aacacaacat BatatagAd? toaziaoatgt catrratmt 14700 
gcrrsfrj«&gi9l: actcaaactta tQagttjggT.r. gatcattggt tttatt^jc^ cctgggiQfogg 14760 
Bttaeta^gg tacctjcattg atatacaaas c&aacsaabgt nMtoaaatil: ggatcaaogt 14fl20 
Eurtctgtma ctcattatet cct-grttttBC cggtatauij^ ^gaEjgstctijc tgajgttttga 14£jaO 
aaijQ^^jcgrjc agcooctata toactttt;^ gaittgooggg cLAttb^goa cccocjg^jaai; 149dO 
ttct^gactj^ i^icLcjgatGt googtogoct ttttatgcat tatCJ^^sata atg^ggmR IBQOO 
gLraat3aggc attoccactg ocgcaaaaac catctgtcte ogsoasittM acogggaaat. l.^d^^J 
taoztactac aactattjcat:a fSQaaaacgaa tatatajgew^ oeucl^ekzicatg cagataaaaa ).5I20 
ct{^D^t.t{9C? cioaococata anRmraaog ccccBAcaar!: goggtaziavLc atg l?)17f) 

1 

aac atfi cact aaa cr.t tat. cca ziasT acc cct gtc gtc aati «tc t£»c 15224 
Aan lie abip Pro Leu lyr Gin lya aSicr Pro V^L Abtj rl& Tyir 
5 LO 15 

aac <:g?L g^c eta &2g ato Cgt zfes^c ate sjac ttt cac cgt. ^^^x z*cc 15272 
Asi;:* Asn Arg Gl^^ Leu <itir XLe Jirg Aea Jtep Hia Ax^ Thr Ihr 
20 25 20 

aac ggc acc ate cert Att «K?t. coc cac caa tat gac L^iv 15320 
Ala 7ifia:i cTLy Tla? II© Am Lie Tfir Arg Citfi ain Tf^ Asp Ser 
35 40 45 aO 

ctt ggg car^ -^jji aav cua crgu acc gat ccg ogt cl^ tat gna qdc aaa lii6» 
Leu Olv^ Hie Leu 5er Gin Ser Thr A^ Pro kr3 Leu Tyr Calu ?Qa D/r; 

5& 60 « 

Dbaa tct aac ttt etc tgg c«e t?it gnt ttg acc ggt aal: (>tt tto 15416 
fSLi Lys ^ Aso ptjc Lbu iTp Gin lyr A^ l^u Thr Gly Asn lie 
70 75 fiD 

cgt suoa siaa age gtc gat get gigt ant gtc acc ttg aat att 1646^ 
cys ate <3au S^jf Vta.L tep Ala dy Arg Oia: val Hir Lou Aan ftsp He 
S5 go 95 

gzia roc C57t CTO eta ctg afs ^ gro ««a osit gtc ata awj L5512 
du Gly Acg Pra Urnj Jxw Ihr vaL Thr JQa Gly Wl TLq Oln Ota: 

LDO lOiE LIO 

cga caa tat ^ a£3g tnt Uac cba cxc g!?t ogt ctg ttg tct gtb acc L5560 
Pa'o Qin Tyx OIu Thr Ser E€r Jjei] Pr.'n Gty Ars Leu Leu VaL Ite* 
115 3:10 125 13 D 

gaa caa ata cca gaa ^auu aca toe cgt ate acc gaa OgC eta att ti^g 1560j% 
Glu Gin lie Pro Glii iJi?s Ttor geir Arg Ilo Ihr Glu Aig usta IIh Tj^? 

•J 35 IdO 14S 

get ggc aat age gaa gca grjtg ^ma aac cat aat ctt gcc caJig tgc 15656 
Ala fily ©car GQw Ala du L^ Aen Kifs Ai^i Aia Ser di:! cys 
150 155 16D 

gtg ogc cac tat gac acg ^cg ggct gtc ooc cga tta gag egl: tbg tea 15701 
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V&l Arg Kis Tyr Asp Ihr Aia GDiy IfeQ "fhr Aaca Leu Giu Ser Leu Ser 
1€6 170 375 

crtg accr fi^t. BiTt gtt tta tct caa tec age caa Cta tts OQC: wac act 15752 
Lau Khr <Ilv Ihr Vhl L<:u iScr CQn Ser £!er Lgu Leu 5er A2(p Ttir 
180 185 190 

oas gBa get agjc tgg aca ggt gat aat gaa acu cm aac ac^ 1580D 

ULn Glu Ala Scr Trp TVnr GQi' fep Asn Ttr V&l Ti:p CSLtj A5;fi 
IdB 2O0 205 310 

Gtg gat i^kc ate tac ara acc ctg agsc goc Ctl. gat gcc ^ucc ?roc l!58<ie 
1^1 JU$ Asp Asp iLe lyr 'rtir iSir Lsru 8ct Mci Ite AEp Ala Ttir Giy 
215 220 22 & 

gt?t tta etc Ajt?t gCsg cw«i s?ot aac att cag agg ct4^ wtx; 1SB96 

Ala Lera Lso !3lir (Sin Thr Aep Ala Lya Gly Ilo dn Atg Leu Tto 
:^30 235 24Q 

tat gat gtg gcc ggs cag cCa aac CP:^^ tjtjg tta acc tta Ma gac? 1S944 
IVr Asp Vol Ala GIv Gin Leu As:k Gly Ser Tip Tj^j Hit Leu 1^ Asp 
245 2S0 255 

caa <xg ^ caa gtjg att. ate aga tec cCS ^ksc tat toe gcc gcc gga 1.5992 
oLfi prti Qiu <al« w). Jle Jlje Ars fer Leu Hil- Tyr £tet ADti Aln. Gly 
260 2eS 270 

coo aas tea cs^ D3»a iia2fs cac aat ggt gtt ate acc aaa tac sgt IfiOdO 
Gin i£u Arg <5L^) Glv^ Hi^ Oly ten Gly val lie Thr Qlu Tyr Ser 
275 280 2g5 290 

bnfc wao ccg oasi wx: cm oag ctt ate giggt acc dma «og cac cgt ccg ifiOfift 
Tyr Glu Pro Glu Thr Gbi <5bx l^efu II-c Gly 'Ibr li^ Thr lli^ Arg Pro 
295 300 30S 

tea gat fifia^ ^ta^L gts tbg ooa osit eta ogt tat ga^ Ust ^ ccg gta 16136 
Ser Aerp Ala L&re val Leu Gin Aep Leu Ary^ Tyr Glu Tyr Prej Vfei 
MO 3.15 .i:jo 

^Kjc aat gtc ate agt acc cgt aat gfto ©ca c^so acc cgj: lite tsa ItlB-a 
Gly Afln ViftJ. IJ^S Ster lie Arc Aai A^ Ala Glu AUSi Tte Arg Phe Trp 
325 330 335 

cac aat osg aaa gtg gcg ccg gaa one oct tat aoc cac gsKi; to?3 ttg 16232 
KlB Asti Oln l/yTS Vlia Ala l?ro Glu Aro Thr Tyv "nir TVr Aap Sfer Leu 
3dO 3^5 350 

t«t OTS etc ate ai?c gca acc ggg qgw gag otg cog aat ata ggt r^og 16280 
Tys: Glii Leu lie Sesr itic Gly Arj? Glu rfct Aia A^jcj Hb Gly GLn 
355 360 :ifi5 

caa. agt wc= CiawSi ctt coc tec etc acc eta ccx tct sjat aac acc 16328 
aLn ser Aen GLfi Leu Pm S^a" 3Le« Thr Leu Pro 5er Assn Asn Thr 
375 iSiJO 2fi& 

tac aor aac tat aoc cgt act taU act cat gac cgt g^c twit ttcr 1C37S 
TVr Thr Am Tyr ato Arg Tto Tyr T^r 7vx" Asp Arg Gly Gly Asn 
.^90 355 400 

act aaa ate cag uoc^ agt too cog sjog acg caa aac ifhtvo Cijc mc aca 16424 
ihr lie G^ Hie Sar Ser Fro Ala Otir CQn Aa> Aan lyr Thr TY^ 
AOS 410 415 

aac ate ac^ get tct aac ogg age aat cage goa gta etc age act ctg 1647?. 
Aisfi rle Hv^ J5er Am Ser Asn Area Ala vial Lot Sbr ITir Ueu 
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420 425 430 

acc gaa ccg gcg can qta gsit get tto tt-t, gut. gca ggc gga oac i€52D 
Thr Cau Asp P3rt> Ala Gifi ^1 Aap Als Leu I*ie Ala CELy Qiy Eiis 
44Q ^45 

<:3g acg ntg ata tea gga csii bx ctug aac t^g aat bca og(c g^L' ^65(^8 
Gin ££01 nhr Lbu Zls Ste CQy CUji A:&at I«c?u Astl Trp Asn 'Jhr Ar? Gly 
d5S 460 

cLa COB cat gtg acs ttg glucr aaa ogg gac sag ggp gcc aftt gat ,1^616 
cau Leu Qln His VHL Tte Tjjw V*) Awg Asp La^ Gly Ala Aan ABp 
no 475 ^i8D 

gat. ogg gas tcss tst cs9c tat ;;»gt ^ac gga etga a9g ats tt^ aaa 16^6^ 
Asp A^g aiv Trp ^lyif Arg ser Aep Gly Arg Ar9 Il-o Ijbq Lys 

4B5 490 495 

ate ^t goa cag osur occ ogc ogc ^isic tct cas 2i!ca cag aga ate act 1.6'?^^ 
iLe Aai Glu GLn Gin Tiir Ser Ser Asn Ser Gljrt nhi: Gin Arg lie Itio: 
500 505 510 

tat ttg ccg ogc^ bte am ct^t est eta wxi vosl aac age acg estc ac^ 3l4'7&D 
Tyr Leu Fxo SET Leu Glu Leu Atg lisa Thr Gin ££n Sii^r Ihr He Tlir 
515 520 555 SaO 

acc= gaa giat ttq caa gtt ate aca gta gga ^aa ©cig ggt cgc i^ca 16608 
Ohr GLu Asp Leu Gin VkL lie Thr Val Oly Qlu Ala <ny Arg Ale Gin 
535 54G 545 

gtfl oaa c?ta tt£;i cat tgcr aac «Krt caa ccg gaa gst iitc gzic aat 1&B5& 
Arg val Leu i^er Gly GlLi::i f?xx> tSlu As(p lie Asp ABn 

550 555 560 

a&t crag eta cgt tal: ago lAc? gftt ai^b ctt ate am: tec agt ctt 16904 
Asm Oln Leu Ax^ iVr aer Tyr Aep Asn Leu lie Gly .9er GLii Ijeu 
565 570 575 

^a tta gac age aaa gga gaa att att agt gag yti^ gzig tac tat ccc 16952 
gLu Leu Afip Sfrr Li^ aiy GQu Jl<^ lie Scr Glu <j1u Glu Tyr ayr Pix» 
560 Gas S^l) 

tac ggc ac^ gca tta tg^ gca aca agg a^g <^ am gaa gpc acft 1700D 
7yr Gly Gly Thr; Aid LtM Trp Ala Tfur Ara Lyb Arg Hir cSu Ala Sor 
b9S 600 610 

tat aaa acc ate cgt tat tea ggt aaa gag egg edt gcc acc gga eta 1704ft 
LyS Tte lie Arg Tyx Sfex Oly lys Glu Arg Aep AIa Thr Gly Leu 

615 620 62^2 

tat tat tac ggt tar caga tat tat ca^ cct tgg gtw rjga cisa tgg tta 17096 
Tytr Tyir Tyr <5ly TVi' Ara Tyr Tyr <51ti Itp V&l Gly A±>s Tip Lew 
63D 635 640 

ai^ qcc ^t oog gca gga aca gta gat ggg t.t$ tta tat egg atg i7Ud 
Ssr Alft Asip Pro Ala Gly 'fhr Val Asp Gly Leii Aan Ij^xi lyr Anj Met 
&d5 tf>SO 655 

gta agg aat aal; ixjg gbt cw^t ctg ctt gat cct gat gga tua atg oca 17192 
Val Arg Aan Aen Ftd ^^1 Thr L€i^ 1^ Aep E>rr> Ai^p Gly Leu Met Rtd 
660 665 670 

aca att gca ^a age ata gca gca ctg caa aws ant sBa Qta gi:?4i grtt 
Ttxr lie Aid Glu Ai^ 11^ Ala Al« Jj^m OIji Uis Aei^ D>^; Vi*! Alofc Asp 
£75 660 6$b 690 
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9og cct teg cca ewxi cwit s^c gqg a^c AUt cgc 172BB 

Ser ALa Pc? Pro Hu: Aen Ala Hhx Vdl Ala He Psa He Aro 
695 "700 VOb 



caog ccc gta Qoa roa octts i:r:t ^cnc: tta ooc aaa gca tea aog agt age 17336 
Pro l*ro S^J ?!la pro Lye Pro ttac Leu i^o Lya Ala Ser 'l"hr Ser Sfer 
710 -715 7:iD 

caa tx:a act ctca tac oct? nU; tr:l: gca agcr ^L-^ aaa CH^a Acig^ 173&d 
Oto Scr Tht* Thr oyr Pro iLe Lye Sear Ala Her lie Dsfb Pro Ihr 'Ilir 
725 7^0 

tcc5 oga too tx::^.: mLL ftr-;i: gcn cca ctg agt oce gts gig» aat aa^ tcrt 17^12 
Ser <^y Ser Sar 11© ihr Ala T^i S^r- Pjcv? ASai L^rs ft?*" 

740 7dS 750 

crct gaa sta tct ctt oca ptus ^isjc oct- cxta aoc sat tct tea sugc 17460 
rrtir ECO Glu ric5 Ixiu PJtti <5bi Tiu: Gin Ser Asr^ Sec S?ir Se^r 
755 76& 7G5 770 

gt^t *t.L t£^ acB aat eta cag »a ss^ tea ttt act tta tat aga gcg 1752B 
Ala xLe 3ar Ttir Asn Lcsu Gdii Lyii Ly? Sear lt«! Thr Leu T^ft Ats AIr 
775 7BD 7SS 

gat aat a^a ticc ttt gac ate cac ogc obcba ttc cct ttt 17576 

itss> A^rg Ser Phe GLu Asp Ust gIji Ser Lye Ph& Fro GLu Gly PJ^ 
79& 795 BOO 

aaa npc tss act cct otd oat scU a^g atg gci^ ^ggj <:as ttt get «gt I7fi24 
Ala 'Xrp Ohr 9ro t&2 Aep Thr 1^ VktL Ala An? Gin Ala seu 
©05 Bid * 815 

gtc ttt aUt ggt ca&g eaiSi gaU tct a&t tti cct aa« ^fsw^ cx>^ ^rlx: nei72 
V&l flrjB JlB GIv GLn Lya Asp thr Ser Asn Lou Pro Glu ihr val 
820 S«Z5 8^0 

£ag aat ata aac aca tgg gf^ aca aaa cca tta aat ^t etc tea 17720 
li/3 Aot A.srt 'r>3fT- Trp Qly Thx- Li« Pro Dy^^r I^j A^srt Ayp Iktii Ssr 
635 B40 645 B50 

act toe ate aoa tat acc aasr oac cw« tct aca gta toe <3tc tct stct 17768 
T^LT? Tyr He L^s lyr Tlir lyc Asp LyE Ser Thr vial 7*1:1 ^Ail /5er.' 'Rir.' 

8S5 S&O 665 

STca att cMitt sucb gem gois gut goa coa ogt t03k cn^g get dcq etc cat 17316 
Alfi llet Aan tfur Glu Ala <^y C^y Gl» Sex Ser Gly Ala Pro 
er70 875 860 

octti titl aat atg gat CU'«: UaU ttl: acre atlL gac. 99$ htcw mcj eta 17664 
GLu lie Asn )4et Asp Leu lyr Glu 7hr lie Aep Gly Lys Leu 
»«5 l$9n H35 

eiat oca cl;-a caca ac^ yija aga l^- asb gfar: isgg gt^ cri. tea eta l-t'A l79l2 
Eto Leu Pro Arg Gly Arg Ser Lys Asp Arg val Pro ser Leu 1^:1^ 
900 905 910 

clU gac aca oca ata aca gca tec ata att gca ctt aat cottz 17960 
Leu A^ aixc Glu He GLu Tlir Al<% Scr lie Tl<? Ala Lju Bi-S 
515 9i0 &25 920 

OTW £X3g gta aat gat gca ^a gtt tcs ttc eta aca aca att org ctt 1600& 
GLy Pro Vfcd Asn Asp Ala Qlv Ifcil S^.r £titt Leii T^ir T.ltt Pjfo Leu 
935 940 945 
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oaa ^t- gtd <^ct. Lat: aag ^ga taa oaaaa£»ttnt atattcTttta lfiD55 
li^ Ann ifel LyB Pro ftnr Lam? Juri^ 

S&d 955 

tctskccttt^en atagfocctct tgaacttaca ctoanaoiggg ogfaaziocasa taa^aaacca 1B1IL5 

tcbttnot^CMi caagcscatDga aagKtal:t.t attttcEitgqc ttjsattactt ttaacattca IBnS 

«)tattefiuiU» fittaaaacaa tatctaacca atta^acaa caatacctta tttzatcatat 18235 

U^8$$t9t£ra aatcaig^aaat: tEatq^buittt aa ukjyL tctt. tatatttatt tctjga^agca 13295 

trt^cacBBt acctta^cocsci tjosjcoctosa cstgattroa aQtcca?3aaa t^ycrtatattt 1B355 

tjcaL-caatat s^^gcaansit^ gcgrattto^ ttyggpistj^tt tbwtiacttat <::0ogcKXicoc 16415- 

gcttttacca csatozjswttn^ iatt?i>9i^ja^ tf^gcfc^s-ocg tcaa^btoucc 1:^3K»ttttgg 16475 

LattcscjEEia oatcwitt/ccws ti^OAt;?i^**a UtAtt^.t-t ^tttmc«>a zjiocsictiKtea 19535 

cgctjcjaacsia azi99n»cuujt <Ajyftl'^r.r^tt. tcarxzagcgl- ^^t.ttct/3a ttetsuKata 13&95 

tratOTgafca ca&gcaagi^ga cBgcatttga taagcaccct cnaCwt^At gfltatoatoa 1S655 

t:toUct9e^t flftnattcagt ogtcacatK actigttatat ocartttcat L&^^aet&fMi 107315 

aataccgcat ctt£$c3Cttat rnnj^rfYitjm attagaaaat tfTttgattat. acaaaCatjog lfi77S 

o^taatQat oziJGgttQ^eic cgictobttt^ CK^gzuDcgteiO eLtArtttatt aacacattct. 18a3£ 

ocacttigtiTc caafcgaccgwg tttgtotcte bbtc^tt^zau tatikat.t.t{:t aogac^ttc ia3;d5 

caa-tt£k&tc?t. csiacttboetg ssbcctbtca tziacaognca s^caactictrCg abatagtgcc ld[^55 

rttocctctt ct>3tgeH3ct-t- DgtpttbtcDc g5b6kgt:cGc;t: oaaacMttg acaooccaca 1$015 

asct^qttmn gtt$AJ&atar^ gngrctgiTba nigt^ggzi^BAg tAJUL Mtaca aai^o^itatcc 1907S 

c3»OTgCitA*CQ lJ9(a&t^(iacl-j.r h^.t-^iUa^JU ^Lixt^l^fW witactttog ttijcCttoaa 19135 

0.$a«iat£tc& Lcacataccc t:crt:tgtt.Ltcr aLtrcfi/^aaa l;A@^Uitt^ ajcofttagaQt^ 1919^5 

atgacaaCQpa tgrtttctact f-f-gcaaU^Ctt tl^^cal.h^ijH ocabgcfftta atogncattc 19255 

«attt<!act* csatcaattQt t?^totW:at UaAtaott^ cooaataatt zxtaaaataaa 19315 

tatAfiiCtt^ ttwttatti:: ct-tscrtflttg ntc-at'sgtM sttttmtca atacctanaq 1S375 

awgt^cpibal.- 1-fet.f.V:l:^l;<:Jt tccssygtHairib::: t-'Jit'^gttt^j 093iJCC35f?roti cqattcttoc 19135 

sj^r.LggeeaLat tcaacccatt 09^x^9 CXiUliU t^lAiAttsiftte oGBSfbgattl: CtE>3catt.t.t 19495 

t^^Ur.trisLcc aacttfig^^^ir. I3l-5*g9rt^t l.l.l,JX3»tttii gcwxjcoijtg ttaacttooa L9555 

ty;tttaej9gi9(i& tar^^taooftA ctbciicx?tbff Idagaatatt ctjjttcajjtjc taccttcoac 19615 

^ctttcazjie -atogg^tzaaa taaccgggc t aaaatcaatx: gtstccBBtg catcaatttr: 1<I67& 

gctgatattt gtcogggcbg cstcattgat aaatgcgatt aaat:qg^.ty cr^^gnatacgg :i9T:^fi 

aatagcBtct ttcactactat gacigs^acatc ggtatiaBct/z actgac&ciaH A9]g(r:r/:ggi:/ir; I979f> 

astcrtocac ttaoatogac cocoaccHtC autfugntsigt ttcgcsctgaa agtaacc^gt 196Bb 

tttcogacc^ gr.t.tl:tfti3l. cta9a^;5tfltt Un^nlAt-w^ gt.tggtacct taaw^ODoga 19915 
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catattg^aa tc?t^*anDgat *tl:JU??aggt>4 Arjcttttgag ataUucroag ^uggntaKate 1?S75 
cggctgcgtg tetttaggt^ C3^crt<v:ttc tacaetaafjc caccgattct cLtteMftey 20035 
aga^gcggc toatttrogt <safipC=^K;a^ tgatettew afjcacaggata agzicogatoft 2DD95 
gtaitttcttcj ftcattt^tiaa t^tftsaeULr. tjcrrtcaiwrc cl^tausogtt otctocatjgt:. 20155 
Rtatccaatt ouuwtctt*?: ctgtttatct atttagaaat frtRt'ftftSgto cgctgatagc 20215 
attoct-tonb aiir.tcaatac ati^txagaa aatjogactcc tcgagLgctfag t1;aU;aatbt 20275 
tigccgOEXsgcf atattcaaoc caatzcgpcgc ctt.tct.tatt agtcaccgtig B02ttn^cca 20335 
bUt;t9gtttc atocagctta ^gcttaaaaa taatztttcxs tttagcrtaczt gggagtta^cg 2CQ'95 
bgnstt^wa aggaqgcatt tttaatactc oaccttgtas gaatattctg tKc^gs:^^ 2<?455 
cttoaacgec ttccaaaaca gggtaaataa ocgggctaaa ateaatcgca UUssuab^tog 20515 
ae^ttttgct aatattcatc tg^ztatgc cattgatsga tgcgattaaa ccyQtbgctQ 20575 
T vtt-.Qrv frp^sr agcatccttc 3oc€^tJ3^ toacatcaigt atiaHCtCBOC gata-^zaa^g ^063^ 
cx:oggt.t£«t tt.to>:ittta c?!tcsboccsu cstroattaaa oairt£*gtttt gcttgaaaat 2G6^b 
£»ci:C3^t(3g ts^tC^CC tUCieMCiUl:itc:#s &«v^?ww^c=cc «ittobasji^ gr^oactttaa 
aaOECGEwat attgSUAtCC «i^^:Sta.iC!L- C&i»g»oacitc cttt^jatatg tsttctgcgg 20BL5 
atauDobgtSfg tliCjggC^tfit. ttcrg^cflgtcta i^joc^.f^lHS^c: c^bgatooat tgattttctt 20675 
(-.a^^gigiigia aagog^tca tttgggtcag cacagccti^B tAtbociOEktc actgacaaiga 2D935 
caaaCaagta ttttttaaiCEt tttatcstsjs; taa^^tttcc tGaatti2Cli;& cograbtotc 20995 
coozLODSitct tcobgbcang ncitagogCtc att^atttcs tcacmaa^ aaci&agar^g 2lQf^?i 
t-£!«!WttSAl<x:l: attaocflcag ataaflAAaci* cxa^Ctlw^tgij cgtoajtast agtgagtbga 21115 
ty0i3&C6f^^ tAcagcagtg ceccvisccatc aabtaigbcoc tttoshataaa ggg^acegBB 21175 
bbtfipSbSieiC;v;t a^gtcatwc gtJCC^r^tta i:d9€fajd<;bt90 attatgatta ttaaatcacc 212:^h 
acC!MiA\:mc^ aagaaa^tt tbc&ttitttV; &a&tbeiC€c» gorabyaatrc gtaagaaagt 21295 
lattevodaaf^. rmtflq.'WRt taattt^cbg tt^a^bCPbAnb ^aaggatoaa BttatrWflM 2125& 
tas<?cc:attt cccagggaoa aoaatiacaca atAtaCa^^ o«ict©3tcc tcacttaatt 2L41S 
IXXib^^tca^ agtai^aaata tcxztgatgec l:jc:^U;a^bg cicataoe^rca atgtcattgg 21d75 
t^bb^tftdti^t acq^cgtb td6taadt;tt Ctbbbstcct boat^^cgag ataoaagtuga 21525 
ttcoccabbb diaiicsis^feabc g^taajRbeiwjg eittccazbskcit agcgocaCat cttecacxzac i2ll75^5 
tcatziattcb citgtnbaaoQ atbggbtaoA tai^aatoDtg Lgtaacagg^ tXat^aa^^ ^11655 
ttatttSLtat caw!»bctab<? aactatctto tatotaottt cacagtcoiia crrx^tAbct; 21715 
OTtsjiccbbco fc^i;SMc5Cft(SiCb9 CCC^tooctoo ^straccttct gat^K:aaca gcTLacaetaa 2i'r/S 
ctatooccgr; gjrtctatfi^tr. atgaocstgU ^«Mfl»btoag cgtagttmcr oggccaogca 21655 
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aaataacLac nona^^tatt netccccan^ ^^aacaccgt tega^tcgt ttc&stgc^fa 
GStcaagcita t^onnoacctEi tcjtattatta caai^Catta aaccctctcsc gat.LactcBt. J^l9s^ 
sggaatgtac gtaatactta tacaii^caiBC ttcacgtcst rraryoRflM II^^Uj^^^a 2201^ 
caaaaLaqac atcf^ctf^at atan^taatag saaBtcoccr. gaaA^L-Ag^l : Uv^jcjT|9cit:.t. 22^5 
taat^^tta occxiDs^kCc ^ta2Laa£taa tttstz-Ltcat t:att.litBg^ U^t^tt^^ 2313^5 
ntt<!S3cctaa t3w<i>g7W!t^^> ttacttsaarj CaaLtcctaa e:3at<Mat DggasArj .rtji 
iJiAiLL-L-:iiLil: ggttcnibgaL ^tULatx^cit y^^gaiftJitM gciK?an.tfltt tcact<$aggcci 222!^^ 
tuQ^tcggatgc girttts^at tcaaaaaggU Aal;gcx:l'.^.tH:- ^^igcwggrtoag oaganos^gpug] 22115 
cscttcatLc s^gtgtcatta aacctatcgc e^l-jacy^iai?iL t.l^^csUOTt ocaaocqatt 2257S 
agtaaaatca ^^qicaicagtta ^scaaagt.1.AA t^LU;ljggtnU tocnj!7t4^giic oattcagtga 22d^h 
BBtaaa^Ltg ccsrcactjgtt tjytRQRfl&gcr }^:lj%l.Calx^ tc^gccs^aic; Qcaatgtata 
tacaatjuktta tcoc^cTgttac tttaagatcc taB&ataljcvi Ul;i9!^bac^ caac9C3C!?ma 225SS 
agtetg«t««i gt:UTOTtcc!fi ctctioataat atCBtcgsttg tcaggataal" gttoogtoot 22€15 
cac:aL^«ce 9^l^t:^bi;:^'tt ccttcatt:aa agmgaaaat atHgtcfx^tl. ttcbtgccsia 22G7& 
atc^bcilsjcx; essaeaaiattst. tttLtatjgta aatatcacga te^tgataSK; gtt<3taccflc 227^5 
tctttttatc octgciogazuk ttttattaat atattcLcrs^ ^rtljgtucuga ti^xicagttt 22795 
^x)^gttb 3P^:t«?ib&t.tc cc^cttctaoci ataatgccaei tt^Lcctoi^ ctcgctoaat 22BG5 
tjCtctcaLsa cBcggpaAV^ ^tcctaateiL t5fcataccttc coctctiucag ijtjcsgtttgst 22915 
ccttEOTggt agtcgttcaa ai^sgbtg?™ oocracnocjt tfjtcococitt ciSLalia^l^t^t. a297S 
cctrkcttagzi EK3a£^Ertrg!^(^ taaiLEBc;aaA9 ^}9u»tur;gf7g Dctnnofstaa atgaizLcr.r.r. '^.iu:^'? 
tttcc^ctgac LccEfctaaaat etttxsflflj^^l <:Lt;txj?w«x«i nacojjrstca tacatjfjrilJ x; :i:s09^ 
ttattttziat tcattottcc £tCDQiaaalji grt w IjggTVft t<j ttciojcaaoaa aacanUfli^^ :J3n5^ 
ct^rOttbttt tct-btcmtt atttaacatfj sMgl-jfil;yua ttanct^jana ttgagLtt.l:i=i ;i:^7lfi 
U^fttclul^l Lttbanoact tatcaa^caa tttLcatrfll. l-Mt,l;^oan2V3i taaatacaac 2327s 
ccrt-Lcttat ^.Lafr^i^v^w tDCittfltaa^ii qaaatttcar: ^sC.alJAUj^i. (:^iiiQtt^aLaa 2323!j 
t^^^^cacBsL taaccar.t-lj* t^.tC^lflbQCDLt.t tcaatLggtt gacasagAf:!. i:at.tatxjttc 232^95 
»^3l.gtA2L. yi^i)£?Q:^i2itt ttaacatt^ staaBttaria r.Agr.i.u«.tiat tcettalDcac 234G!> 
t^3Cft3»u--ttcjc ttttttcgta tagtacatjca rjgr.t^utittt acx^tfrcrat ttcctgocafS iifjlf: 
at:ctcatt£^ D£0[^agtt giSArij^Lli.ljg c/i^i7Cqcttt Itttqcaaca gctattiLaa^ Z^svs 
c^%a^a>gt^ OQatagtgat taa^jcjtayny ttix*»gintr*t ntctqctctt taogcracaa^ 2i£i:3S 
ctjgcnrw«?ca ttcctati^ca tatctcagct atttcuudM^ wxit^ntct natcatctct 2i69S 
i^l.LCL^tg csaacggB&LiLi AiAx'tctoat tgsittcatcia ij^i>toMij^M MtAtg^atrft 2>75'> 
l.i/^flt.T-tact. Lt abg act Lta aBA gijyi Alrt zmt ai:q ^ysv. U"xj rr.-t- <jt^ 
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Met Il« Istra Gly II Asn Ptet Asn Eer Pro VftJ Ty^ 

9c^ at-ri 1^^. gar. gf^ tta asA j^li- rjiq l^C tbt oot t^it ctg aca 2:^rtS4 
GLu lie Pro Asp Val Jjeu lys lie Gin eye GLy She- Gift CVi? Le^u l^ir 
970 975 9B0 

g£iL &tn cac age tct ttr. Ann gsA li^t cac cac caa (jta tec gaa Xl^m 
ABp ILb Sot Stt Sgt Ph© Abo Giu a^e Mie GCli:i Glri y^J Scr Ghi 
9SS 9^ f3t95 1000 

cac etc tec t(?3 tec ijoa cac gac tts u^t. erst gnt gen ooa cajj 23550 
Wirt l^i 5er TKp ser gLij A1& Hi-s fi^sp £^ 'jyr his Asp Ala Gin aJrt 
1O05 IQIO lois 

goc caa aasi aat ogg ct^ taC gaa gog cgf. atl. cstt aim oqc ac^ 22&9a 
can Tyft Asp A-=;/i Ajig oyr Glu Ala Ang lie iws Are Ttar 
1D20 1025 1030 

aat cct caa tto caa 5»t tjct gta cat ctt at^ gte gc?? cct aat 24046 
AiiTi Pro Leu GlM Am Ala Va.l His Ltjii ALo ile val Ala Pra 
1035 1040 1045 

iTr^^L g&a ctci ata ggc LaU «iac caa ttt bqc ggc agg gey:: aet ccwi 2409^ 
Ala <:;lu Le« lie Glv 'lifr Aen Aro Gin Ehe fier Giy Aru Ala Ser GLn 
1O50 1055 1060 

tat gtD goQ vcq ggt ao= t^x: *its ttc tec ccc gcc get tot 24142 
Tyt/ ALs Pttt Gly Thr \fel Ser ser Ifet Eiie Sto; Pro Aia Ala Tyi 
lOtS 1O70 1075 i{)eO 

ttc act g3^g ctt t^it ogit ^ gca ogc e^it tta cac gcc afjc gat tci;. 24190 
i£u Ihr Glu L&a Tyar Ax^ Glu Ala itrg ash [jsu itia Ali^ ^^i^r Asd i*er 
1065 IC^D 1095 

Bl.L lAt t:tg gat act ogc ogc cca gjat crt^ aaA tra aCff Qcg etc 242in 
V&.1 'lyr Axu Leu Aep Thr Awf Airig Fti? A^ lieu L|yB Eer ttet aIa T^i 

1105 1110 

agt cas caa aat aii^ c*ca (jaa ctt tec act a\jr. Xnl tt?i tec aat 242B6 
ssa: Cln GCLn Aan Met A^ c^ij Leu Sex Thr bau Ser l£u Ser Af;tJ 
ills 1120 1125 

ij6g eta tta ttg g^i age att aaa act gag lx:U 5«»g ctg gnt aat tat 
Glu b&u Le^j Leu GQu Eer I Ufs Ttir Giu Ser lwb beu Afai ivr 
1130 113B 1140 

fict caa gtg atg gwA ^tc etc toe get ttc cgc ocii Ux; gcc ^roa aog 2d:^82 
•Xhr OlTi Vftl Met yClu >tet Le-J ast.^ /il^ pt^ Arg i^o iSer Gly AJa Hir 
1145 1150 1155 iieo 

Oct tac cac gat get V^ic ijM oa.t rytg cgt saa gt:i ^<Uj ozjq eta caa 
k>ro iVr HiB Asp Ala Tyr Glu Aan Stal Anj Lys v&l Ile GIjk Ljcu GLn 

niO 1175 

?»nt Oct CTtt t^ag caa tl-dt **aL- cy;t tco cca gcc att gpr. r-uj 2447 B 
Ai:i^ P/o Gly }^.u Gill OLn Leu Asa Ala ftor- Al^ Lie Ala Gly Ijeu 
11 BO 1LB5 1190 

atu csi.t tec eta tUj ggc- att- cicK? gct tea ate tea cot ^M^Z^ 

WE-t His Gin Ala Ser I^j Leu GLy lie Asn Ala Geir III^ Se*r Pro Olxi 
1 19"^ 1200 1205 

ttg ttt. artt «tt. ctu «3a gag gag att aie:t y^^i ggt aat i?;rt »q 9sa 2^574 
PiTe Aen ri& Leu Ohr Glu Glu He l^ir Glu Gly f^.tk AJ.r* Glu Glu 
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1210 



ctt t^t. w«3 AftA iaar. t.nr: 9191: aat hTjC. gas cy^ 0f:t. U;a Obg tjcc atg 2'1622 
Leu Tyr Dys Jjys Asn rtie dy jRsn lie uLu Pro Ala Ser Leu Ala MfiC. 
1225 12:^11 V^^^ J210 

Ijte: cTtt a^a figt. tjai. tjac aar. rj.ia a^c cjat gpn gna etc anc 24 670 
n-o GGLu 'ivr bs« Ara 'ivr 'lyr Aun Jjgu sex cSlu gId i^^ .Se^r* 
124B 1250 125!> 

caci ttt 3tt OTC aaa c^c age aac ttc ggr cbb. cba gtiA km ^^gt eat 2171S 
Oln Hx^ Do Gly T^.s Alw -Scr ton Ptso Gly (an GOn tau Tmt Geo: Asn 
1260 126S r2lo 



aac caa etc ate act ccjj ata gtc aac a^c aat gat aiggc ac^ gUr- 
ten Gin Leu T^e TTar Prrji Ilti Vhl ten Sbt ten tep Gly ite val ijv?b 
i275 1230 



gLa tit qgra att acc cigpc gaa tat aca era aai'- ftftt uoq gta ^;ac 
Val 'm teT3 Ilv TChr Arg lyr Thr 'i'hx Asn Ala Aaii Glri \tel Am? 
12&0 12$S i:5«ll 



gtgf eta ctt coc tac ggt ssg^ qiaa aat tat cag t.La swU tac naa 2^1662 
v^.l QlxA Tjeij Pte Etx> Tyr fij.y GLy filu Asn TVr Gin Leu Aen Tyr Lye 
1305 1310 131G 1320 

ttc doa gat Vx:l c^gt oeig gat gtc tec tat tta tec ate aaa tta e^u 24910 
PI:e uye Asp Sar Arg Gin %fel Ser tyr- Bex.- Ilu la^ Leu Am 
1325 1330 1335 

gnc ogn. ywj ctt ate cna att gaia aga gcg cct cag gtc ^wcs otc 24958 
Asp Jjye Arjj Glu Leu He Arg He du Gdy AJ» Pro Gin Vul Aan lie 
13dG 134 S 1350 

CM tat tea gaa cat atx: atiii SiUi rj^^ ajtva jsict gD.t ate a^ caa cel. J>S0OG 
OlTa Tyr a^x G-li^ Hift= He *1»]q: Leu Sex TTJir Ttr Asp lift a;ir CQn 1>eo 
1355 1360 1365 

ttt g>a ntc gee ct^ oca oc^ C3ta tat ecu tct agt. t^:l. tr?g sroa tD:t 2GOE>4 
F%ie Glu He Gly 1^ Thjf A±g WhI Tj-r Prr> -Sct 9er tfer T!rp Ala Tyj- 
1 1375 i3ilQ 

gca yrir- yc--? ttt occ Qtt qaj; tat BBC caa tii^r. tct ttc cto 2S102 
Ala Ala Ala Lye Ehe Ttvr lie Glu Gdu Tyr Assn Gin IVr Sei- Phe Lkij 
1^«5 1390 1395 1400 

iJji *r;Ur AiC. gcft att cct eta tct ogt gcg acH g^a?f tta tea 2S15a 
XiSfu Uys Leu Aen Lys Ala He Pirg l^i J5ftr- Atb Al?i Ihn Glu L^ ser 
14ns i<jiti 14 J 5 

rtsc: ecx: att Ctg gia^ «i3t att pta c^jt aj^ gtt aat caq caa ^bj gat 2519ft 
Pro rrta: He Leu Glu j;t-r He Vrtl Arg Ser Vbil Aen tsoii Gli:i A.';i^ 
1420 ld2fi L«0 



aLc aac gca qaa gta tt-i gj^l. gi.L ti.L net noa tat tat atg 
He Aeh Mrt Giu Visl Leu Gly Lya Vial Phe L^j Tlu P^/.s Tyr iVr Met 
li3S T4Jn 14^5 



?.s:?46 



csa ogt tat get att aaL t^± ^ n^-.K cjqc ctca ata eta t^c aat gca 
01 n Ang Tyr AJ*» 7 ly A-sn Ala Ulu llir Ala beu He Lifiij Cw: At;n Ala 
1450 M55 1460 



ctt att too o» r.igt:. tcfi tat gat aat caa C3ct- agu ttt gnt ctjc 
Uv r]ft ser GLn Arg Ser lyr Asi> toi Olri Vu> ser Gbi p!he Asp Arg 
1465 1470 1480 



25342 
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ctg ttt aat o€0 oca ttn cca mc ggc cwa t^iu tAl- U;\. 9g!A OAT. 3&^90 
leu PJn? ^Tir Pro Leu Leiu Aai dy Gdii Tyr Phe Ser Ttu- cay AEp 
1465 1490 149S 

^ gagg sLt niit tta ciat oca oijt act ijgjc qfDLt ta<2 ctit aQ£* tec 
GL\1 Glvj ri^t Jt-p lA^fti Tifrt P/ti GJ.y Set" 'Hw Uly Tr|V ArlJ ry!"; S^r 

15O0 15D5 15M 

gtg ctt aaa csgt- E3ca ttt Mt ixtc oat oat att toe etc too ccjc eta 2M86 
VaJ l#nj 4?B tog Ala Phe Asn lie Asp Aai^ rle Sei" ijfi-u nyi- Arg Leij 
151S 1520 :i52?i 

ctt. <^ aim ficrc aac cat aar. aai: <:raa ^ai- a\x. a^rt a^ii: 2f>5.34 

Lieui LyB ale Ibr Asn IJas As^ Asn Glrj Asp (dQy ivs lie Ivs Aan Aan 
1530 L5iO 

mv. aat ctt tct gat tta tat Ar.t gpgg aaa tt-a gcai gfta att 25582 
Leu Asn Asn Lew Asp Leu ivx lie <j1v Lys Ijeu Ljeu Ala Gdu He 
1.5dh 15S0 1555 1550 

cat caa tta aec att raat q&a. ttg gat tta ttg ctg gtt gwc gijg ^ 25^30 
Hit; Cli> ihr lie A*?p GJu Icu Asp Lbu Lew Leu tol Alo Vai <5ly 
1565 1570 

g^. 99t^ Qpift acT. afili l.^A get ate z&l ^lat Ma caa ctg gog cKra 2Sb7H 
Glu GGly Glu Ihr Asn Leo Ser A3b He Sesr As^ <Uri I^^^i Alia. Alzi 

15B5 1S90 

ct^ ate coa asfws etc oat aoc ntt ekjc sgtc tjfici eta -ca^ aca cag -2t^T2f> 
Um He Arg Lye Leia Am <l!hr lie TJir Val Tii Leu GOn Ohr Gin Jjys 
L595 16D0 

tag 4Cg ttcr £2aa thjji tbt gtt ofcs ^>Gt tec coc acjr tat aac aaa 25774 
Trp 3£x Ala Gin Leu Ifie val Met *r £er Itur f3er Ty^ Afjji J^y:--. 
I&IO i£ii?> 1620 

acij ctg aoca cct oaa att aa^ aat crtg ctjg ^»jU aiv. t-csc one ggt. 2SB22 
TTiL- Leu Tht- Pjfo Glu lie; T^>ft5 Asn Lew Lou Asp 'ihr Val IV^ HLb <3ly 
1^5 1S30 l6jtb 

tta caa qqc ttt oat aaa gac aag qcb &ftL l.l^ i^tij cat gbt etg gcr? 25B7& 
Lw airi Gly PlTift Lys Asp l^ys Ala Abu Leru Le^j His val Met Ala 
1645 IbiD 1655 

ccc tat att q^og E20C ace tta caa tta tea teg a^I <jtr- gcc cat 25918 
FVD lie Ala Ala fOir Tjfj^i Qiia Lcti Ser iter Glu Asn val Ala JULs 
1660 166S i67<) 

tct. yli? r^l^J C^iT. I^s 9ca goc ong tta nag ccc oqc gsc c^: ytji el^ JtS^fifi 
Qer val Leu Leu 'Srp Ala A£p Leni Ly.-; Pxtj oiy Assp Olv Ala Met 
1&75 16BD ISHh 

qcc cj:ia aaa ttc tgg g&c tgg ttg aAl: e^.i. utV ^wn oca ryst 26014 
•Hi)- Ala Glu lA/r. Wk: ixp Aarp 'i-rp i^u Asn llnr Gin lyr Thr Pio Afa^ 
1690 1695 1700 

tea teg gaa ota tta ^ A<rjrt tv»g <'^:c <»tt. gtf- carj t<nt tgt cof? 26062 
Sci Si.-.r OUi V<a5 Lcii Ala 'ihr Gin HiB Lie Wal GIj^ Tyr Cy^ GJm 
1705 1710 7 7} 5 1720 

ttQ Qcu GZtCt tC^i Qoa atg qtt tac cat tec aoc gyf. a^.i: «m1. gM» >6110 
ALa Lew Ala GLn LBu Glu I4ei: VjaI Tyr- Hit> £cr Ihr Cily ile Asn Giu 
1725 1730 iV^b 
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^ar. gcx tLc cgp dxi ttL ^aa £aaa cca atjQi ttt tog toes 
Asn Ala l*hs Arg JL^i P4>ft Vlai T*ir Lya: Pj:*^ Glp Met Ite Gly Sfer Ser 
n^O 1745 ryfto 



261SB 



RCK gca gta Oct gc3g cat gat gca ctt tea ctg fttc ate ctg aog 2S206 
Thr {31u Ala Val ALa EiLs Asp Ala LiSiu £^ Leu lie >tet T^u Oltaz- 
1755 1760 

ogL ttU gst tgg gtt aat ?>cgf tta ggc ga^ as^ gnc tct tec gta 2625d 
Arg Ite Ala Asp Tip Veal Assri Ala Leu Oly Glu LryB Ala Set A^r- Vol 
17-71) 1775' nso 



eta ^jog gca ttt <jsia oct 5«id eujt tta aog gca c^sa tt^ got oat 
Leu Alia FSie Gl\i Ala Abl^ Ser ijeu Thr Alya GUi Gbi beu Ala AEp 
l^SS 1790 1795 IBOO 



2^02 



goc atg aat ctt ?3at qct £aat ttg eta ttg caa gcc agt acU coa oca 26350 
Al« Wat Ae3i f^s Asp Ala Asm l^i Jji^ Leu Gin Ala Ser Thr (Slii Ala 
ia05 IBIO 1015 

cas aac oat- caa csab utt mc ocsa gtg 6*c!3 caa aaa aat ^^t; tic ccc 2E^dS 
Cib-j Aen HLe Gbi tiLs beu Pro Pro ^al Tto On lys Asn ALa Plie Ser 
18SQ 1B25 le^D 



t^t. t^ff ^ tct ate gac act atrr cts cto tgs gtt aat get gc^ caa 
cya Trp Ulir Sar lie Asp Thr 1*1© Ueu dn Trp Vbl Asr> Vfeal Ala Gli:* 

ISdO 16^5 



26448 



caa ttg aat gtc gpc oca cag gq&, tftl; tOc get ttg gto ggg ctg &*t 
GLn Lau Asn "Wal ALa Pro GLn Gly VaL Ser Ala Leu GOy Lea Aso 
ISSD lfi55 I860 



26494 



tat ffitt caa tta aat raa aaa ate ozc acc t&L ^'^c- cag t©3 ^^aa agt :2&54^ 
Tyr iLe <nrj i£u Aen dri 141^ He Pro Thr TVr Ala Gbn Ttp Giu Ser 
1&^& 1070 1675 ifieo 

gtrt gas ata tt^ ai^l, goc ^3 ttJj e^at tea caa cag cfir.t gat ata 26&90 
Ala Gly Glu lis teu Iftiof Ala Gly Leu Asn Ser Q3r> Gin ALa A^ rle 
1885 lego iR3?i 

tta cac get ttt ttg gsc ga^ U:t cyi.^ «gt gcc Qca tta ai^ aix: t«c 2G638 
ijau His Ala Ri5? Lsu pjsp Glu Ser Arg Bar Ala Aid Lou Ser Ttar OVr 

1905 ISIO 

tat ate ogt caa gt^ qcc ^shcug oca gcg oca gcc ata aaa cist oat 266&6 
I'yx He Aig <Sln Vtal Ala Lye Pra Ala Al-ft Ala He Ia^b Ser Arc A^u 
lSt5 1920 



gac ttg tac caa tan IJ^ eta c>tt ijat aat cag gtt tec gca ate 
Aep Lsu 'iVr Gin ivr Leu Leu lie Asp Asin Gin VrtI Ser Ala ALa lie 
193i) 193?> 19:J0 



26734 



asa act acc egg eL<: gcjj^ gcc att age att caa ci^ t»c trte 267*52 
Lys l^a: '3hr Am lie Ala Ala ll^^ Al^ Sex Il<a Gin Leu Tyi- vvsi 
13^?> 1950 ISSS I960 

OMT^ Cty gaa aat gta gaa gaa ear. gfx: C«L gnST gtt ate 26y;jn 

Aan Arg Tlir L^u c^u A:^i Vhl CSlu Glu aht- Ala Hie Ser GAy Veal He 
196S ] 570 1$75 



one 03t oac? ttc ctt ate qac tgg gas aaia Ivs^ t«i«- aoba coc Lac age 
ser Arg Gb:i 3?tie Ptae He Asp Trp tep lys iVr Aen LyE Aarg Tyr js^-r 

19aS 1930 



tii;pc; gag ggt gtt tct oaa Lta gtt Cac tac ocg AAct Ijjt- att 2(^926 
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l\yr Ttii A1& Gly Val Gin VaL Tyr lyr Pro Glu Aan Tyx lie 
1995 20O0 



qat ccc cwc? ntjg Cgl; ate gg?i ijw acx: Aaa at^g Btg gacr gca tLB r.t.<f 
SkXji Tfir Het Ai:q He Gly Gin llir ivs Met EHet Asp Ala Leu Leu 
201O 201 5 



c^*a toe gtc oyt^ of» wye ooa ttA ^*at 40SO ^ttt aOt ^bC gaw gMr' qtr.': 
C^ti f>ii Val Ser Gin Ser Gin Leu Aen Ala AB|> Hh: Val Glu Aep Ala 
202[^ 2030 20^0 



27022 



ttt €ttjQ tct K^t ots acn tcs ttt gwg c»«i. ^rtg gx*;!:. a^^b iwiw gt^- 
rt*- Met Sex TVr Usu ohr ser Plie Glu GLn vaL ALa Asn Leu val 
2045 2050 



27070 



;i.tt ^gC 9^Jg tat cac gat. aaL att aac aac ^t caa ggg ctg aoc tat 
He ser Ala IVr 1:^ Asp Asn lie Aan Aen Abp Gin OXy Lcii Thr Tyr 
20ea 2065 2D7Q 



2711 B 



aljc: ggicr etc age gaa acL gat acc ggt. gaa tiac tat tgg ogcr 
1^ lie <^y Leu £ter Glu I'hr Asp Ite Gly Glu 'Tyr lyr 'Itp Ax!j Sei- 
2Q'IS 2aE^ 



271 66 



gtc gst cac ai^ aaa ttc age gac ggt a^ ttc goc get aat gicie L^c^ 
'AU A;5i> Ri* SSfj;^ Fj^rtis Ster Ass? dy Lys Hia ALa Ala Asm Als 'lip 
2090 20&S 21-DD 



:i72i« 



Eict gna. tOT CTC acso att jsxt tjot oca att aat oct tac coa acjc act 
Ser Glu Tifp His lie Ttt^p Rfo Tie Aen Rrc> Tyr Axg j5ej" Thr 
2105 2110 2115 2120 



57262 



ato c>5t ci:^^. 4te 6»t;5 boD Ma toe coc ttc tat cts? etc tgn ttg gaa 
He As:g itx) val Ket Tyr lyK aer Arg Leu oyr Leu Trp l^j G3v 
:ifl25 2130 2135 



273ID 



ciaa aag gag ate act aa;^ ?ix3er ey« af^b aec 5W«» gat g^c tat cnix 
drj Iy:> CIvi Xl.<^ Thr li/s Oln "rm: <jIv Asn Ser Lys Asp GIy lyr dn 
2140 214E JlSU 



2735& 



ncc ^j9B ^MTSt- t^t cgt tat oan eta £4aa ttg gcg eat aitc tat 
l*ir Glu Thr Asp Arg Tyr Glu leu Lyfj I^jj Aia ai?> I.lt* Arg IVr 
2L5?v 2IS0 2165 



27^06 



get l-i^g oca ate act ttt. gat. gte aat gaa aaa aLa 

Asp Gly Thr Trp Aon T^u? T?rt> lie Tbr Stafc Asj^ Asn Glu Lys lie 
2170 2175 2ia0 



274 S4 



ccc r.L+i-, ga?* cto ^ca aaa aat asa gog Dct g^g -etc tat tgt get 
Ser D/B Leu C^u Leu Ala Uy?^ l.y?: Al^t? Pro Gly Tyr Cy& Alci 
2]I8^ 2190 2195 22(J0 



gcjl. caa ggL yw^ g^t- zseg ttij eta CTtt at?i ttt tc*t a£wr caa caa 
Gly I'yr Gin Oly <^Iu Asp thi' Leu Leu Val Met She Tyi A:riii Cin Gin 

2210 221 B 



27libO 



S^t am etc gett L^gt tat aaa ace get tea atg caa ggg eLa t^t atjir.' 
A'aU "iVir L£u Afli^ JE5t?i TVr Lvs atio: Ala Ser Met CJin Uly Leu lyr lie 
2220 :ia2.s :^23n 



ttt Qoc gat Qtff oaa tat aaa gat atg acc gat gga caa Lac aaa Txii\ 
Pl>e Ala Aep Met Glu Tyj- Tyr.; Asp Wcjt TTtt Asp Cly Oln TVr Sox 
2235 22^0 2245 



:^7Mfc 



egg gae aae age tat auA i^cw* ttx: gai. ^t^t wt agt. gtc cvga 
iVr Arg Arjv Aj^j fiftf TVr Lys Gin Asp Ihr Asn Ser V&l Arg A!:<t 



27694 
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2250 2255 2260 

gtg yAl- aac c^c tat E?cei nag nat cat gaa citt ccc tea tocj -sata oat 27712 
val ten Jten Arg Tva" ALa Glu qyi: Glu He Pro Eea: Ser Val J^n 
226S 2270 2275 2280 

^9t: fjgc aas sjoc tot oat t-swi QOa wat tat cat etc agt atg gta tat 2770C> 
Arn ly:? <>J-y Trp Gly Asp Tyt" Tv^J^ M;^. 'lyr 

2285 22S0 22&5 

aac gpa oat fltt- crjw tirj:!-. ^p^i.i.. d^igt uac- jwwi goc> cw:ri ten isgt OQt tta 2783 b 
Oly Aep He SPro Ito He aer Tvr Lva Ala Tlar Ser Ser Asp Lf^u 
:^3co ;iSob 2510 

«£wi tat ate tc^T cca tta ajggs att act cat aat LaL gtJM 37690 
lys xle iVr He Se-r Prf> l^o T/su Ar.^ lie lie His ten Oly 'lyr tslu 

2320 2325 

gro <_;^g caa age aat caa tgc aat eta atg aat a^a t&t gg«: etAA (?tft 2793-1 
Gly gOtj tJin Arg Asn OIej Asn Lcii Msst ten Lys lyr ULy L^/IE £isu 
2330 233& 

sgU ^t aaa tct att Ejtt tat act aac ttg gga gtt £ac oca aaI- 27952 
GLv Asp Ly!> Pte lie VaI T^r Tfrnr fiter.' I^ji Oiy \feii Asn Cro Aan Asn 
2350 2255 2360 

teg too aat ong rt<T «tu ttt tac ccc get tat caa tat aac gga aat 
Sgt Ser Asm Lefu Met Tyr Pro Vbl ly^; GUn IVr ftsa <£Ly Ann 
2365 2370 237i> 

gtc agt ggg ctt e>gt caa (jgc oga tta eta ttc cbc egt ace >iP078 
S^- Gly Leu Sar Gin GCly Arg Lfifu ije^j Ehe Bis Acg Asrp ihi: ten 
2380 2385 2390 

tat too vxl. aaa glia gssii yf^u dtt ^x::t- gga oca Q^a cgt tct eta 28126 
lyr Ser Ser Uys v&l Glu Ala Trp He Pro Gly Ala Gly Kr-g Ffjr 
23¥!> 2400 2405 

eicii A^t cogi aat gpn gcc att ggt Cist gat i'M. gCl- giric tcrr tta 2B171 
Tta" Asn A'JTi AliA Ala IId GQy Asp 'jyr Ala Thr Asp Ser I^i 

2aio 2dis ';>4:;iu 

tiiMS aaa ocg aat gat ctt aag caa tar gftr: t.«t atg net oi*c agt aaa 26222 
ten Lys Pro Asn Aep Lje^u Ift^E tatn 'iyr >tal lyr Use Tluf Af-p Tyss?; 
:!4i!i 2d3D 2JJ5 24d6 

ggt act get acc gat gtc tea cca gta gac *»tt? act oca att 28270 
Oly TJ"- AlA Tttt- /i^si^ Vj33 Sk'.r Oly Prxi %il Aap He Aen Ttu" Al^ lie 
2445 2450 ?A^5 

tec cog m«i gtt. cag eta «ja cfta aaa gcc ggt age aaa cw^ t-j^ 2S3L6 
Scyc- pJY> Ala liyB Val Glri T^ir V^l. Dyr.; Ala dy sier Lys Glu Gin 

ar.i^ ut acc gcg gat /S^at- gi;c l-x:^; aiL c-aiy coa tec cct age ttt 2B366 
Ttior Phe itir Ala Aep pys Aen v&l Sei- He Ghi Pr<rj <5?r- pre Ser Phc 
2475 2480 24S5 

^t cTOi^ <stJj caat tat cca ttt oiat fict etc gaa ata gat ggf: i-i-ii Af^l. :4H414 
A'.?i> olu Met Asii nyi- GLn r^ie ajj^ aIa Iah my Jii* a^ Oly ss&r ESei" 
2490 2495 2500 

cto <i«i. ttt act aac aat tea gcc agL a^t. g»i; iitt. ccc ttt acc epcq 28462 
L£i> Asm 1^ 'Ita- Asn ten Scr Ain Sot He Asp He Thi* F^e Thi' Ala 
2505 251& 251t 2520 
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ttt 033* gutj <ff^t. gga cgt aaa ctg sjgt tat gaoa i3»gt ttc c*gt att oct 
pr«i Ala Glu Asp Qlv Ara Ly« Iatu <5l.y Tyr GLu Ser E^ie S«r ll& Pro 
LJJ^S 2535 



att acc cgc Db»cj sbg lauL ai::l: e^^f. aat. L<x: ctg acc ctg cgc cae aat 
ile CFtir Arg Lys Val Ser Thr Aep Aon iSer Leu "itu: Leu Arc His Asri 
2540 ^^^^ 



2&B5B 



aat acg I'^l- Atg <i:aa tgg ^fga grtc tat atL cgt ctt 
m.u A3=sn Giy Ala Gin Tj>t Wet tsClri 'ftp Oly VaL TVr Ar^j I)<r. Ari^ 
255S ^5£D 2S65 



«wt tt3 ttt get ocjc caa tta cjtt ocg oaa <acc xt occ [jgt att 
Abtj Thx beii Kk; Alvia Arg Q}U 3>=:u val JMa Airg Ala Tlix Thr GLy lie 
?.5in 2575 2580 



43r;y Att ctg o^t atg ^»aa act cag aat att cs^a gaa ccq cag tta 
Jtisp Tta: lie Lc« sGx tfct- CQu TVir Gli^ 7*$!^ Jl&. <^hn <5iiJ Pr^ Giri 
25r5F. 2590 2595 260O 



2£r7D2 



e^gcr &aa ggt ttc toe ijct aog ttc Bta ata oct c?cir tat csac ooo tra 
CaLv l^y*:? 0]y Fte /ila 1^ Phe Xle E^ro Pro lyr Aetx Pro 9^ 
2605 2610 26-15 



2S7K) 



act mt 9gt Q^e Ltt aag ct.l: Car. ate aaa cat gtc yi.U 

olm^ liis Gly Aap I2lu Ar^ Trp k^s Leu 'JVr He V<;a Val 

:i£20 2625 2630 



9^1^ AAt aat tcs cat att ate tat tea ggt cag eta aaa ^t aca aat 
Aap Afln Psn Kis Il« 11<3 TiT Sex Gly Obi Leu I^-s Asi> Ttir ten 

2640 2645 



ata age acc sbog tea ttt ate cct ctt gat nat titt cca ttg aac caB 
llic S^' Hir frtuf Leu Ph^ T.lez Pm U%i Aro Asp Prts l^ia Aftn <1|n 
2650 2655 * 2660 



28d^l 



C^.t tnc «g^^ gw gtt tac atg noc ttc Tiag ziaa tea txv% C-cfi 
;>j=.p Ser Ala Uys val lyr Itet Thr thie Lye Lye Ser Pro Ser as^j 
2605 7:^70 7^-7^ 2&B0 



28942 



gfjt ti^ gkjf; i>ct OH*? tbt 4tt <i»gw. i^t- 9«t tvsvm ©aei ttt« gta 
Gly Ihr Tirp Itp Uly Pro Hlb fte Vkl Arq Aep Asp lye Gly lie Val 
:^ee5 2630 2^55 



2BBS0 



una eita C^:K- c^oa tec «&t ttg ajcc cnc Ctt gous enrjc gtc otat 
ito Ile Aan Pro l^ Eer lie Leu ^Ito [ILe Pbe Glu Get val Asn VA^ 
?70O 2705 2710 



29018 



ctg Dixt ^ua.t i^M. ac?t Ei»Q oca .2tn c>at ttc- cage ggc gr.^t anc c^c 
L£ra -»-sn Asn lie &sr Ser Glu Pro tiet ^jsp Pirie- Ser Gly Ala Aet^ Ser 
2715 2720 2725 



2^086 



cte tQt ttt tf?y ct^T ttc tric tat eioc ccri citrr ctg gtt nee caa 
U^ii Tyy Phe Tr^ Glu l^i^ Ptie Tyr lyr Tlir' Pit^ Taj V^) CI" 
2730 2735 27^0 



29134 



cgt ttg ttq c^t caa aae ttt gat gaa ge^ aac cgc ctg aaa 
ArT3 li^i T^ij His Cln Asn ftK? Asp Glu Ala Asn Ars Ttp LcTU lys 
2?45 2750 27S5 2?6P 



tat iitc tgrj ogc ccq toe tat £att tjtt cek: (soc caa att casi aat 
TV^c Val Trp Der Pro Ser Gly Tyr lie VQl SJts Gly Glyi lie Rlj^ A5ir> 
27« 2770 2775 



2^30 
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t£C caa iDgg aac ate cqc ccc Cta ttQ ijBik trat ncc aicjt tjjg aac agt 
TVr Gin Ttp Vb±J. Jurg Pro l£!0 Leu GLia Tlhi* J3fer fpjc^j 9F?r 
27 BO 27 E5 27SD 



2927ft 



^at cct ttfj oat toe ate sjtxt, cx:c qqc oog ctn Qog cqc cog 
A.')fa Prt> Aep Ser VaL tep Pao Aep Ala V&l Ala GIj^ Hi.?; Al^l? Pro 
2755 2 BOO 2B0S 



2032^ 



at^ cnc tat ejwi gt-t- tea ozo ttt ate coc «*oc ctt qat ctn ttg ate 
Met Jlis Tyar Ijys V&l Sen- Ttir Elie Met Angi Tl-u: Leu L£ij lifeii j"li^ 
2810 2B15 2B20 



29a7d 



^•y^ Cijc tr-^- ™t get tac cgc: csm ti:^ g^^t «cjg ctt md 

Ala Ars3 GLv Asp Alis IVr Aca Gin Leu CSlu Al-q Jtep Ttn- L€u Aed.^ 



2^22 



gaa gog aag aitg tjgg tat atg £2aa gag ctjg oat. crtg tta ^nepci gnt 
Glu ALa Mst *fl-p 'JVx Met OLn ALa Leu KIb Leu Leu Tiap L^: 

aesn 2855 



29470 



Cirt tjat crtQ <2cg cbg agt fitoc= l:£(yf ^leit cx*i cga ctg ^jac 
Pro 'IVr Lew Leu 3sr l^ur itir Tup Am Aep Pro Arg Leu J.yqj 

2B70 



29516 



gcc gcg att acn acc caa agt get cat tea agn Lea AUa st<? cct 
Alis 7aa ALP lift Tt^ 'Jt^ Gdn Ster AJi» Kis Sfer Sser Ser lie V&l Ala 
2E75 2380 



29SS€ 



ttg cag act ooa cog gcsg ctt tta tjca ttg £?gc age grcr 
T£u Jirg C3l3 J Ser pr^o Ala Leu Lau fler Leu Arg 3er Ala Abo Ttir 
2890 28;55 



29G14 



etc *:?c ^t etc ttc eta cxzg caa ate aat gtg atjg at^ aet twc 
l£u Thr A^ Leu Pti£f Leu Pi^ Olt^ Jio Asn Glu Veal itet Met Asn Tyr 
2905 2910 2915 i92fl 



296<^2 



tgc! caa ACS tta cpni c«g ogn gta toe aac ctg osjc eac aac etc tct 
iTp OlTi Ttcr Lgu Ala (iln Arg v&l Tyr Aen Leu Arg ills Am Letj ftor 
292S ?.330 2»5 



2^711) 



ntc gwD ggt co^3 doq tta tat ctg oca atjc Cau gcxt ^ca <30g cj^j ©sic 
iLe A^ Gly GLo P/.vi fj™ TVr Lau i^o ile 'lyr Ala Otnr Firo Ala Acp 
2940 244?i 2950 



2S76& 



003 ais^fc gcg tta cte age ^ gut gci:; act tcl w»« ggt- gga ogc 
Pro V^'S Ala leu Ijeu Sex Ala Ala VbQ Ala 'Ite Ser Gin Gly Gly Gly 

2955 2S60 3g^f> 



29606 



<viy i,z[,^ i;C5 tjt*g tcfl ttt atg tcc Ctg Lgg ogt ttc cxjg caci A^,u etc 
U^: leu Pno Giu Ser Pitit mi IjC« Ttp Ar3 ilx^ Kis Wet 
^!^n 2975 2&eQ 



29a&1 



e*at [y;t coe age at? gtt a/c^c cag ctn aca cm l.Lc tn^;: Ux; noq 
Olo Asi-j Ala Airt^ 8*ru- »ct Viil £ter <;1ji Lau ^ha: Gin Fh& Gly G&r 
29BS 2990 2995 3000 



29902 



tta caa aat att ate txjt uc»g o»c cpooi qna gcr? etc oat neg tta 
LcTU Oln Pi^i IIg lie (XLu Arg Gin Asp Ala Glu Ala 1^ A«ai Al^ ]<tu 



3010 



tta caa aat ca^ goc gca c^fsg cl;g I Lg wwt; <;t5 ivjt- om 
T^j Asji Gbi Ala <j1u Leu lie Leu ^hi: Asn Leu Ser ll^ Gin 
3020 ^t}2b 303D 



2$99& 



oac geia gasi ctg gat gcc gag aaa acc gtg ccg gaa &aa ^DOdh 
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fi£a 1MB l^MT Xle Giiu Glu lieu Asp Ala Glu La^ Ihr VSil ijem Glu 1^ 
^Q3S IfOdD :^C4^ 

Co:; guQ ^fft gc^ trg ttt geat riigc tAt. a^a utg c»t 3009^ 
£er Uys Aia c^ly Ai£> <iln tier Arq ]Wie /isip Ser lyr Ser Lys Leu iJiJS 
305<? :*a5f> 3060 

g?*t gjsrfi at?it^ ^tjz ft£^c ggl- ^aa ycf. »bg D<?a eta cc»a ^jog 3Dld2 

tep GLu Aso lie Asn Ala <5Lv ^Slu Am Gin Ala Met 'Ufa: Lrtu Axy ALea 
1065 3a/U 3Cr75 3060 

rxx: Crca ^fioc ggg ctt sec aeg gcg gtt. cag gcia r^:i; csgt- cto Qcc one ]S01i>0 
£tei- Ala Al/a Oly TTir Thr Al5i VsQ Gin Ala iJer Ang^ Leo Ala Gly 
30B5 2090 .^095 

qra Cjct oat cts «n cct aac ate ttc ggc ttc gcc ggt ggt ggt 3D23B 
i33 Aiw Ala AEp L£iU val Rn^ AW PfrK! CQy Ite Ala tsly Gdy dy 
3100 310& :^13 1) 

cicc tgij ggg get ate ijct sssa Esoa acc sygc tat gta atg erwia ttt 3Q2B6 
fit-t^ Arg Ttp dy aIb He ALa GLu ALa T3u- Oly lyr Vfel Hat Glu Plie 
311& 3120 3125 



to^: 5^r;i. eat gtt atg lisL ^og 9«t aaa att aj^ eaa tct 

S€Q- Ala >^ V^l >la& Aan rjir G^u Ala Aap Xle Sar <3]n &er Olu 
3130 31jMi 3140 



30334 



aoc tsc ogt 03c ogt gag tgg g^a 003 dat aat 
Tnr Tb<T Atg Arg Ara Arn Gin tilu '1^ Glu lie Gin Artg Ai&it Abr Ala 
*2LiS 1L55 M.f^U 



gaa g^rg ctg aaa caa <:;(^ gaU goe ctt oaa teg ctg g:=a glvs 
Glu Ala Clu Leu Gin Leu Asp Ala Gin lys Gcr Lou Ala V^I 

3nO 



3043 D 



cgc cart Qxi 000 qec gta ctg caa aaa acc a^; c^tc eina ace caa caa 
Angi Tirg Glu Ala A1& v^i Oln I^ys Tiir £^ Leu Lys rAvt* nin Oln 
31B0 31B5 3190 



3U47H 



<37^g UTC cjaa tta rare ttc etg caa cgt: ^l^c aat 

Glu Gin Glin Ala Glxk Letj Ala T/Ai Oln Arg 1^ Flie Skl- /^^i 
3195 3200 3205 



30526 



os;? gK>g ttg tec a-ac toa eta cgt ggc oga ctg gt-^a gos att tac ttc: 
•Gin Ala l^Li T^p: Asn Tr£> I^eu Aih ^Jly Arg Leu Ala Als Il<r 'JVr Itie 
32L0 3215 :3220 



iA"v?. i l^c i A:: t^^:; tt^ Qct ate Qc?5 cgt tgt tta &ijg rjsq caa get 
Gin FhB TvT A^ Leu Ala Ll& Ala Ai^ qy!> Lou t-tet Ala Gbj Gbi fcic* 
3:£2S 3230 323& 32>10 



tficr oyt tgy ar.l- .^»gr. cjiat gp.c tct qct cgc ttt am, ewisji uog E?3e 
lyr Ara 'l"rp Glu lie Ser Ae^-j* f5fer Aiat Aru He Lye Pr<r> Gly 
:v^A^ 3250 3255 



30670 



gee t?iCT oaa ooa acc cat gca ggr. crtg (Xly <jca a?Tt gaa ace ttg atg 
kla Tip Gin Gly Tlir 'JVr Ala Gly Usa Jjbu aIa Gly Glu 'nir Leu Mat 
3260 32<i5 :i270 



inn M 



eta agt ttg gca ^raa aty s^3^^ ^ac- i3c?c cat tta aga cgi^ gm. ^l^jcjc; 
Lgu St^ Llx> AJjft Oln i4?t Olu ALa Ui£ L&ki A/.-^ Artj Asp Lvb Ar<T 
327S 32 no 32as 



30"7G6 



?Tca ttc* 955 glit: 9^ rjta ten ctg ocx: ga^ iitt t^it 

Ala Leu Glu v&l Olu Arg 'l*ir Val Ser Leu Ala Cl^* lie lyr Ala Giy 



3061^ 
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:&2dti 32db yjt^u 

tta ccg oaa gat iinn ggc cca ttc tuoc ctg aog caa gsa aLci grty ^ay 30862 
IjGf*.? Pto i^JjL A«Cir> 53yft Gly Pro Flic S«r Ujii "nir LiLn GLu He C^lu Ciys 
n^)^ 32Lh 3320 

g^J^ Ad:c gca ggft tc« 991: Agn y^-r gg^. rt«t- ««t nut ttq 3DS1D 
Leu vaJ AEn Ala Oly Ber Gly iter Ala Gly Ser Gly Asn AHn Ass^ 5«!u 

I3^n 3335 

gc3 ttt ggc acg gac* acrL aaa ac<-. Lcl. Ujj c^Hy 5r:2i tec att 3&&&fs 

Alrt Oly Al^-rt fily T^ir Aep Tlir Lys 1^ Ser Leu Gin Ala Ser iJrf^ 
3340 iidS :u5n 

ten ttcfc gcit oat tta ni%E» att cgt gaigg ^t tac cogi ^aa tct. gg?? 31D06 
Sftr Leu Ala AEp Lea Ly/i JJ*- Ary CQju Asp TVr Pro <au ber lie Gly 
3355 336D ^i36?> 

£Km acc CDC! cgic: eitc c«*a cag ate agfz gtt aoc ctjg crjg ^cy eta ttp 310Bd 
lie JSrci Arg lie; i^f: fl]Ti TJ.<^ Scr AftiL Ttor L3u l^-o jQa Leu Lei^ 
^370 3375 33 BD 

□g?t oct bAi^ C5^:3 yy^ gtfj cog CK:a aLa tta tct tac gf^^. c^L *wan gcc 3110(2 
Gly F-ro Tyr -uLn Asp V&l Gin /Ufl ri»^ Tyr Gly Asf^ Ala 

HilRfi 3395 MW 

goa tta g^^g a^c gfc*r: gc« gct3 eta Qcc qtt tec cac 9^ ac^ 311S0 
Oly Leti Ala Asn Oly Ala Leu Ala V^'l Ser Hiii Cly 'Jti Aso 

3405 3410 Silb 

gac qgt CBB. ttc CU^ g«t ttc «eic ggc ttc ctg i^tsg 31I5e 
Af^ air Gly Gin <:in Leu Asp rbe Aen ABp Gly Hic Leu Pra 
3420 3<i2«> 3130 

i:L-.f. ggl: atn ycm? ^itt sjat caa ^ aog ctis ^icl^i cty e*gt ttt cct 3li46 
GIu Oly ll© Ala lie JL-si) Gin ^lly TSir Leu 1%a: Leu Ser Plwi Pti^ 
3^35 3^40 

aat gca tee aog cca oan gsjt caa gcc act atg ijji <%fm acc 3129-3 
Ayri Al<a Sor 'T^v: Pro Ala lye Gly lyis Glii Al^ Thr M(?t beu Lvb Thr 
3450 3455 3^S6D 

ctg aac gat ate att czit att ojic tac acc att a^g ta* 3L336 
LeLi Asn Asp IL<?» lie Leu His; lit- Anj TVi* Thr ILe U/b 

3^tB :M70 

ccatcccasc acagaactfstt i^apcoodccoc qaatcgjgsggt ct</gtEaftgi5« ytttct atq 3139S 

c^D ncrt- tea CQD cura ttc sl^: «iy kied:.: etc ten tta cct aacj gyt: ^1^43 
GLri Aa-j Eer Gin Thr P>w ^er Met Ihr C51li Lcaj Eer Leu Pr^i Lys Oly 
3^60 3^B5 S^SO id*S 

TO- aiju <jl:c. cstt occ ggt alxj yjgu yno gc-ft tta acc ccg gcc giyg 31451 
fily Gly 7ila lift Hir Civ Met Gly Glu Ma frtu Tt>r ptt? Alo Gly Kro 
3500 3505 ^510 

<3at nst <itg qca gcc toci ctq oca tto coc att tct (^x,: gg« cRt 31&3!? 
A-.q^ Gly W^. Alw Al^ Leu iSer Ijku Pr<i r*3La 11© ifer Ala Gly Arg 
i515 3520 .^y^fi 

C>3t t-cit gcc ccc tog etc Afxj lU-y ?ie*c t-fv:: iiric a^yc qga aoc yyl: Art<: 3"l5Eir7 
Gly Tyr Ala Pro Ser l>ju "I'hr Leu Asm Tvr Asji Sfci Oly Tlir Oly Asm 
3530 isas 3540 
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OTC cog etc got etc ggt Cgg ©ac tgr. aau etc atg noa att cgt cqt iL635 
Py'T Pro Gly Leu C?Jy Trp Aei;> <5ys Asn Xtel Met -Thr lift Aitj tog 
5[>15 3550 3555 

ca=^ ooc agt e>cc rjgc gtj? ccg sat tat gal: ^ acxj gat act ttt ctg 
Anji wr asfc ol'ir Gly Vul. Pr^ Asn 'JVr Asp Glu Thx rttir Krj ijexi 
2:^60 ^570 

a>W o£7n gut qod. qtg ttg gtc gtA ttn iKit s?ca ggt cmp^s :4173i 
CUy Pxo Glu dy Giu VfeJ VoL Vtjtl XLa Leu Aca Glu Al^a Oly -UlJi 
J580 35^D 

<jc?t. 9^it ta^ rtgt-. gc*a tec tea tts cag ^91- ^ix ««t ttg atq 31779 
Ala Aep iLe Area iSer Glu Sear l*rvi Cln Oly lie Asn Leu Gly JA^l 

3M0 3^>(?5 

acc tec a.=c gtt ace ©gt nat toe ocjt Ctg gaa age carr ttt aac 31827 
Tbr rte: Hir Vc^l T!hr <jLv 'XV^ Ai:g Ser Arrtj Leu Glu Bsr Hie Phe SSfti- 
3et0 3615 36:40 

cssg ttg gaa tac t^ cis«3i ccx? aica aca ggc gc^ 4^ gat ttc tgcr :il^5 
An3 L©U! Glu Tyr Trp Gin Pro GLn Thr- ihr dy Ala itir Aep Phi? IVP 
^62S 3630 aSSS 

ctg ata tac aoc ocxr gac gtj-c^ csfl«i gcc oat tta ctg ggcr $,-10 osat ccz 311*23 
be-a Il« Tyr Sex Pa? Agp yly Gin Ala Hlf; ljt?x? Lnji Gli' ae^ Fa^^ 
3&40 3ed5 3£50 3655 

dt^n gon cgc ^Ito ?agc oat cos eta aat 9tt a«c can aca gcg ca3. bgu 31971 
GLri ALs Arg lie Ser Af^i Pro Levi Asn val Asn Oin Tla: Ala Gin »rrp 
3660 3665 3670 

ct^rt tt^ gciz tec at£i tea tec <jgc (jag cag att tat lat caiij 32D1& 
Leu Leu cn.y Ala Gea: V&i fcr Sht His Gly GLu Oln IDjt ly^ (Sli:i 
3675 leas 

Lat oc^^ goc gafi crwU g^sa net gat t^jc <^t sec gan etc aca gtn; 32067 
Tyr Aiv Al<ji Glu Asp Glu Thr A,sp (Slu "Ihr Asp Glu l^u Ala 
3630 369S :?700 

cac ocg OjSc oca ace gtc cc*^ cgc tec ctg caa gta g\'A cat tac ggt 2513!? 
Hie PiD Ai;ri Thr TTir Val GLn Arg Tyr U-xk Gin VtoL val liia -syr <31y 
3705 3710 :57]5 

^at eta acc 5SOC age g&a gta ttt 00c aces ct^a aaI <jnt gat cca ?.?!ie^3 
Aa^ Lsu Thr Ala Olu \tel Ptae Pita TChr Leu Asn Gly Aep pro 
372S i730 i73S 

I lilt .^^H \jCX ^ tr>a ttg ttc 1:^1. t.t^fl gt^a ttt c*st tac ggL cr^ iiiii 
lus-j LA'S 5er Gly Tcp Leu Fh© Le-u VaL Pte A;fp TVr GQy Glu ?ixg 
3740 :i-?45 :^750 

□iKi oae «5ic tta tct c^a ^Ce tog oc<i ttt aaa gcc aca aql tj^g 322 bs 
liyi^ ^1 leu S^>r Glii Me-t lire. P>n> Phc Lys Ala "l^tr ser /kfm Trp 

3760 37 &5 

cAX. tr^ cgt: aaa mw,^ osjt ttt tccr C9I. I^c ama t2iD ijgt ttt gcs ttg y?:M)y 
Lea L^/B Arv lys ABp A/i? Fte* irter Arrj Tyr Glu 'Syr Gly P)v Aloi Leu 

3-??^ 37 BO 

cgc aec egg cgc i.Ijj cspr caa ata ctg aio^ i,ut new cfjt ctg caa 323 
Arrj Tiir Arg Ain Leu Cys Arg GJx> lie Leu Wet felie His Arg c^i Glu 
37 e& M^o i79b 

Licc ctg tct ggt rag gca ©gi: gpnc geit goDt ccc q^ea tta gl.i. 1:^403 
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Thr Ijeu Ser Glv <jln Al« La^i Rly Jtefi Asp gQLu Pro Aia Leu V&l Sfer 
3eOO 3310 38L5 

ogt tit^ atA ctg gat tat ^fuc qur anc oog ^jtg gtc 5>gt. okjq etc cjtt 
Arg Leu lie Leu Ajbip TVt VaI Val ^er -rtir L«SrtJ Val 

UfiSQ 3825 3830 



32451 



tc^t gtc lixj^ cga gtg gga est ^ caa csat ggc aca acg gcg gtc gcc 
ser l.(dl Acica Ace Val Gly Bis <:iv <?Ui As^P ciy Hit Ihr Ala Val Ala 
3S3S 3B10 SSdS 



ctg cingi oca ttg m^Z4 etc ^ tat cao cct ttt qodl cc3 pais caa aaa 
Lot Prc> F^i.> Leu Glu Leu Ala Tyr dn Pro Jl» Glii Pic alu Gdrj Ly: 
1650 3855 3BG0 



325d7 



gca etc tau 03a rca Ktg ^i-a cUj gug awl tte osic «c?c ate c»a 
Ala La^ Trp Arg RfD ttet As^p val Leu Ala Asa Pte Aei^ Thr lift <3li> 
3B65 3870 3875 

cgc tgg C3aa ots ctt gut cbg oaa ggc 03c eta ccc asjt att ctg 
Ar^ Gbi l£u Xjsu Aap Leu Gin Gly Glu Gly val Pm Gly lie T<a3 
3BB0 3885 3690 3B9S 

tat rae 9^1: assa aat gig?r: tgg tgg Lar. cga td; ^irU msi cgt- cae aro 
Tyr GOn ABP I^ys Asn Gly 'rEp -Prp TVr Arc £JBr Ala Gin Am gCltj Tl:ff 
3^00 

^t G03 5jtc occ toa aM atg caa etc ctt crt 
GDy GLu GlLi Cfet Ami Ala Val Tto Trp Gly Ly-: M^l-. GOn L^u Lsu Ptro 
391S 3S20 

ate ang ozc get att coj? oat aac »dc tea ctg atg gat att aat ggt 
-Tie 54fcr Pro Ala lie Gltt Asp Asn Ala fl&r Tjejj ^ Rsp n© ^ cay 
3530 3935 3540 

oat ggcj can CUg ^^b gtt ate am gryt ctio 905 eta ara; ggt Cat 
Aep Gly Gin Leu Aap n-p vaL lie Thr tSly Pro Gly l£u Ai^ GJy TVr- 
594^> 39f>D 3955 

<^cr age cag cat oca gsc fi^c agt tgg aiCc* i?gt WiL v^i.^ ^xx? t-ti^ oc«? 
Hi© £ter GOn His Pro Aop Oly Ser 1^ u^ir Ar^ itie iti* Pto Leta 
39W 



3970 



3375 



gt^f Utfl ccig ata gaA <:at: Ouc.- c^r^ c^^c ctt ©oa gat tta 

Ala Leu PCD lie isiu nyr l^a: HiB Pro Arg Ala GLn 1^ ALa feq> Leu 
^fl^SO J985 3S90 

jitg cigc, CKZc q^j ctg try: v.t.^ ^t-Q Ct^ att ^?3t. coc aaa age gtg 
Wet Ala Gly Lfu Ser Asi> Lesj Vtil Leu lie Glv Pro Lwe Sfer Vm T 

t:gt ttQj tat ast aa^: cy^t. g^L ggr. ttl". ^X: -i^tja ygw txS3 oat ?Ttg 
.*jr?] ie-u T^rr Ala Asm Aan Arq Asp Gly PS-je Ttar Glu Gly Arcj Asp Vb 1 
A016 ^015 4020 

gljg i-j^ tccf ggt. ggt. ^Lx «cr; cUg fjog tta ocag mc <^t am cot 
val GLn »er Gly Oly rl& Thr Leu Pro L^u Pro <5ly Ala A™ Alo Ara 

403O -1035 



32595 



32643 



32t91 



32B3b 



32863 



22931 



32979 



33D27 



3-3l)V.S 



aa^ tea gtg goc ttt ajjc aae tita cte ggt tea gcjc caa goa oak n^l^j 

L£u Val Ala Ptoe Sea: Asp V^J rw Oly ."^t Gly cin Ala Kis Leu 
•^040 W5 4050 <3Ci5& 

c^tt y»ia gtt agt s^eg aea aaa gLc: ycc t^gs cca ojkt ct^ 03a cat 33175 
vsl olu Val Ala 'rtu- Lye v«i -ihr cys Trp Pro Asn Leu Gly Hi© 
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cjgt ttL ptTt cag coa ejtc ^3i;v* 99^ ttt age caa tec oe?c 

Gly Arn Oiy <3LrL Pro lie ito beu ITlo GLy Itie £ter COj) S^r AJa 
d07G ^OliU 4086 



33219 



gtx: ^iiJi. t.rt aat cct qat ooa ott oait cts gcv- <j«l. r;ljy gtir. ggt Bgt. 
Als tert lt*i?. A:;ri P/.tt A5:p Arg u=Q ]]ie Lieu Ala Asp beu Asp Glv Bbt 



g^L cct qcc <7Rt ctg att t^t gM. t^ftl- ^r:!. gss^f: cet ctg csst att ttc i33i5 
Gly JF^i> Ala tep Leu lie lyr \Jtil «ia Ala Bi& Leu Asq? Jli^ Mi« 
dlO:» 411.0 dll5 

age Mt Gcic? Qgv. aac ggt ttt ^jca cea oca ttc aca etc ogt ttc 
S^' Giu Set Uly Aan Olv ALa Glii Pro R*l^ "Jtr Tjeij Arg FSift 
dl20 4i:>s 4130 4135 

oct r?5C our^ cttj liijl-. Slit ^t crat acn tgc cag eta gtg get gat i5411 
A^jf> -Oly uefu Arci Phe Asp Asp 'ito: Oys Oln b3u uLn Vrsl Ala Asp 
^141) ii4b 4150 

Ota cas gr^ ttft gf^ gtt ggtc age ctg ate ctg age gta ccg cat abq 334 &9 
vai Gin Glv ijeu Gly Vol Vtal .5c?r I^^^i. T)u J;*?r ViiL Pj:i> Ha^i f^^x 
41ii& 4160 4165 



gi.:ti cx:& cae cat tgg cge tgc gat ctg eicc aac ^cq ju»c ccc tog ttii 3^3507 
AJa Pro fctis Hio Trp Ar.iy ci^ Tic^i Uiu TtTT AdEA-j AJa LyE Pro rptp Leu 
il70 4175 41BD 

etc srjt gsiB atg aac aac aac atg o^a racx-- cat otc Ofcc ctg- cat tac 33555 
Leu Gl.ii ^'*eit- Afij^- A.':ii Met Gly AJa Hie Hie "ri'ir Leu oLb avr 
4iaEi 4190 4195 



CV^ ar?D tec gVj:: <7i?^ ttt tgg Ctg geit griitt yOc; g(;^4 ^r;^ 

/org siea- Ser Viil oln Ffae Trp lie« Asp tsdu L6*s Alo AJ^s AJa lisu Ala 
«0D d20i> 1210 42i& 

c*;^3 caa aca oca C5tc tgt tc^c eta ccc ttc ccg ate cat acc ctg 
'i^ir i"^y niji Thr no v&l f^?i ayr Lea Pro Piie Pro ^1 uLf, Tlir 
^220 4225 ^230 

tgcj caa aca ooa qcc oag Qa.t oaa ate aqc tigc aat aaa tta Qjtg xc 

'^rv Oln Ttii' Glu GJli Ari{^ (tlu lift J3Wr; (^y Atai J.ysT; tlir 

4235 4240 4245 

act tta c?5t tt*c BCt cac qi?c escc trjg gat sroa ogt t^aa cgjca qas ttt 
■Rir T/^jj Arg Tyr Ala Gly Ala Ttp Awp G]y Arg Rbj Ay.^ al.u Wv- 
^250 4255 4260 



33*51 



33&99 



33747 



cgc Qqc ttt ggc: teit gtt ocig 0213 cica qqc cigc c^It cnn ctQ i?cc cac^ 
Atu rjy file Gly Tyr val Glu Glii Asp Ser llis Gdi^ Leu Ala Glri 
426& ^270 4275 



33755 



ggc aat qc?T cc?3 qoa cgt aca tcc» coc? qca ctt aoc aaa aac tgg tat 33B43 
01 V A:^i Aly Pro Glu AiT3 TTm: &.^c Altt Lcm Tin lys Aeo Itp TVi- 
^:^I5£] 42«S d^<?D 

gr^ aec gga ate cct gaci gta gsc aat aog eta tjct goc ggg tat tgg 33891 
Alci rnr Cly 7lc- Pno Clu V«i Asp Aot TTuc Lcth .S<?r Ale» Otly 'IVr w 
'4300 ^4305 1310 

egc (KJt ??at aog oaa cjct ttc crct ggt tt-t xn era cnc ttt cvct etc 33939 
Ath Gjy Asp Otu- Gin Ala Ttir Gly Fhe itn- Pro jiis Phe Thx Leu 
^^15 43;!<J 43;>s 
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tg^Ji tiAA gag ggc asta oat srtt cwa r^l^ aca coci gaa gat cac aat 33937 
Txp liys Glti Giy T^= A5:p V£sL Pro Leu "1^ Eev GLu ftisip A^p Kis A^si 
*3:30 4335 

citi^ lAcr 1:^ tta aac oca ctca aa*^ ^1: oca ccg cgt. £agt 3^035 
bau lyr 'l*rp Utm fcui Af.x^ Ala Leu glLy <oiln Leu Arc iJer Ciiu 
asjujs 4350 

<:\'jC tac ggg eta oat qcjc j^ec gw? f"-^ fiftg aag ate ccc tat aca gtg 24DS3 
Leu 'iVr Oly l£U Aep dy Ser Ala Glji ULn Lys lU^ Piri Tyr TTir V^l 
42560 4555 437D 437S 

act toe r;ef: irTifl caa gtg ccac caa tta c»a oiat dwsc act ooc ctt 34131 
'mr Glu SsGT Ai:>^ 1^ Gin Vol Antj Gin Gin Aep Aen Thr Olto- Leu 
d3e0 4335 4390 

toj glig etc tgg tea gtg j?tg ptiK* jajgt ^:st iigt tftt oar tat 34L79 
Sex Pro V&l Leu Tip Alp S«!!3" Vtal ifel GLu Ser Arq Eer Tyr ]tLs Tyr 
4i9S 4400 ■ 4405 

CJgt ate ate aic^c ccc .rEX£* t^jc c»t cag ©eit ?ibc wet ctg tec 3422? 
Glu Arg lie Il« ©ftjc- Ai^j P'iCi^ fiLn Cys hen Gin Aep lit Thx Leu Ser 
<i410 4415 

aijt eta ttc- ggg ocm* ctg Cftjg gt-t. tea gtg caa tat occr :^d2f7!i 
Ssr ASfj lisu Gly Gin Pro lye CiLn VfaiL Ser Vlil GOn UVr Pro 
4^25 4430 dd;2b 

ogc ogc aat aa£2 cca aca £aec aat ccij tat- wjC--« Cl-ft CCa gwt 34333 

Arg Arg Afi^i li^ Pro Thi^ Asan Pro Tjnr Pro Aep Tito Leu Pro Asy^ 
d^^O 444 S 445g 4455 

act ctB ttfc ero wgc ?«gt tat osto gftc rjaiia caa caa cLa ttg agg tLa 3^371 
ffint^ l£« ftie Ala Ser Ser Tj^x A^ Asp Gin CSln Gin Lbu Leu A^ Leu 
4460 4^65 4170 

lacsr: tac ca^ caa tee agt tgg cat cat eta att get aat ^jna etc *3a 3441S 
l^u" I'io- Gin GlUi Sr?r Tip Eli:? Hi:> T*iaj litfi Als Afa:i Glu Ltfaj Axg 
4d7& 4480 44 

gtg tta gga tta D3g gat ggt aca agt qat oct ttc act tac gat 34467 
VlU Leai Gly I^c-u Pro ^£s> Gly Thr Arg Scr AJi? Tin? l^rr fe^ 
4490 44£lB 4S00 

get aaa esc gtfi cxrt Qtt <^t ggt tta Mt ctg gna get ctn tgt qact 34515 
ftU^ itie \fal Pro Vfcji A*i> Gly L£U Asa Leu du Ala JJeo Cye Ala 
4505 4510 451..'i 

?«ai. ngr: ctg ^1.1. yi;^; g^jf. g^h mmm <:i:t cgcf gaa t£sc etc afijc ciag 34Ji6:i 
Gslu Aan Ser Leu lie Ala Asp Aep l^s iTo Am Glu IVr Leu Asn Gin 
■4520 4525 4535 

caa cga acg ttc tat sec get ggg aaa aec nat gna ana Mt ccw Dtoa 34611 
Cbi Are 'Star Ftit^ TVr Ihr Asp Gly Lve Thi A^Jt^ Oly r4*?s A.w Pjf> Tlir 
Jbdo 4S45 4550 

ctg aaa aca ccg aca cga sag get tta ate qcc ttt acc qaa o«:?r 34G5S 
Pro Lexi Lys Thr 'Jhr Arg Gin Ala Us* TJ^ AJj* Pbe Tt^; alu Thy; 
d5S5 4S60 45^5 

geg cjta tto ncr? giw* tc-t cto tt^s toe gca t^i. cj^jc: yyi. ^ra^ :*4va7 
Alw wal Leu Ha: Glu Ser Leu Sesc Ala Ftse Aep Gly Gly lie Ihr 
4570 a57«j ^5I4U 
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^5f:fl ^ass tt3 coc giggfj ctt cwixi can oca gga tac cafl gaa 
Kfo ASP cslu ii^u Glv Sjsu Leu Ohr Gla^ Ala Oly lyr Gin G^ln Glu 
45^5 459^J «95 



3^755 



oct. t^t Ct^ ttc CC« ziEjt pgjc ^aa aac caB g(U> bgg gta oca cgc 
9xi^ oyr Leu Phe Pro Leii flir Gly Clu Asxi Gin Trp Ala Arg 
4600 4605 4ek]L0 

c«>a t?tl. acf: grtt toe QS^ act gag gta <-^vl ttt t-ro csgt ect gtc 
Lys GLy TV^ Ttir JSsp lyr Gly Tiiar G*Iu VzH CSin Ete Trp Ait^ PtT> Vol 
d6:&D 4625 4G30 



gca ogit asc ace- cag lil^ gg^ ^uiiDt ac^ act. eta SAtA bg? oat 
Ala Gin Atg Assan nir ClJi Leu llir Giy L^^-a Tt«- Ttir Leu Lvs Trp Aq) 
^£35 4640 4£dfS 



acc cac t^c tjgt gtc ate act <:a?A Ace cas -gac gcg get. ggL tto ^Mi^t 
'Ihr Hi& Tyr CV^ Val Lie Itir Glii Ttir Cil2i^ Adij Ala Ala GQy Lfiu Tt^r' 
4 ESQ 46^5 



149dV 



gtc Lea gcc E»t tat gac ngy cift ttt etc ajCE cct atg etc act 
Vhl f>f^* Aia astj Tyr A;^ 'i^p Ara Fte L»i Ttir pro >iet Gli:* L£u ifur: 
4665 4670 lti7B 



g^it Cite J^^^ g^^^. .*«it gtD cat ate ata ai^ ttg Eiat ^jcq eta gigp* cgc 
Asp lie Aan Asp Asn v&l iilfs ll<& ILe Leu A^ /iIj^s Gly Aim^ 
d^D 4665 d6»D 



35043 



cct gtc act caa cgt ttjc: t€5 stD gaa aaf. g^it gtg bc^ ggt 
Pro Vsil ITiT C^ln Aro l*pe Gly rifi Glu Asn GLy VfeL Ala Ttir <^y 
4700 4705 4710 



3509L 



tstc tct tcBfe cca iTaa gca avs-s to^i tec act oca ccb gbc got- gtc aat 
Tysr ser £er Pro Glv ALa Lya Kra Ei^ T*^ Pro Ito Val Asp VftL Asn 
4715 4720 47;?^ 



ycb ^ck; etU yctw [^txj ac?c <^ cca cljt cct otc gcg cag tgi. gtc 
Ala Ala lie Ala Leu Thar Cly LEti Pno V&l Ah-ii Cys Leu Val 
<J7:3D ^735 ^740 



351 B7 



tau eo9' CXig sp?^; *J«cjt tgg stg ccg cl;^ ttc ogt cag gaa aoc ttc asac 
Tyr Ala Pro Asp Sear T^ Met 1^ Ijbu Ete Rly GLri GLii Thr Phe Aan 
4745 47SQ 4755 



35235 



ijtotJi tla a£?g cseg saa caa aag etc ogt gat tta ogc? att ate 
Thr Leu Thr Gin GLu Gi,u Gin Uys nhr Leu Aarig Asp Leu Arg Tie lie 
4750 4765 4770 4775 



352G3 



iCtt g£ia g^t tgg f=gt. ate tgc gc» i^xj get ctrc cgc ogt tgg cssa 
•iJir GLu A^ Itp Arg iLs Qyi Ala Leu Ala Arg Ainj Arg TVp Leu GLn 



3G3il 



^t. Afsa ec*:: sca oca t.tjy gtt aaq etQ tta Acn^ age ate 
Sei- GLn 1^ Ala Gly Thr iiBu VaI ?j.y/.; Lcaa Leu ihr A«2n S^r Tic 



4795 



4SO0 



qgt tta oct cce cac etc atq ctg get fli-aj cjiw.'. cgt tat ogt. :iSJ:?7 
aiy T^j Pyvj Pt^^ His Aen Leu I-cw Ala 'ihr Aap Aig Tyi- Asp Arg 
4BL0 4015 4B20 

gat tct QrZiZL caa att <':gt co«i cab qUc gc&. iite ^t- gat SMt ttt 3b4*ys 
Asp £er Glu Glt^ Gin He atq gLti GJji Vj^i Ala xte :iea- AEp Gly Pim 
4B25 4£l30 433^ 



OTC cgt. ttg btg gc<3 act guj egg cat najg gca ggct g«<* tjgy 35523 
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cay Ar^r Lew Leu Oln Alcft Aln V^l His: Glu ALa Gly Glu Ala Tip 
4^1) da^B 48S0 4B55 

cfgt. aa:: caa ^&cit tct ctg gtg £ico at^r ona ^jat £i^cc oaa 35571 
Gin Ars Aetj (sQn Agp oiy Sdr Leu Vhl Via' lys Olo AS£i Thr 1^ 
4B€(i d&65 4670 

&2g cgfz tgg gog att acn ocia ogjc act oaa tat «jac asit aaca gOT cap 
ihr Apg Tip AJa lift TJuf Gl.y Attg ttu." fills ^yr Jksp aot 3jy« Oly GJjft 
4375 48BD ^365 

gee at^i r;ga >3<f:L tAL ^!r:i^ Ijit I.Ijx t^l'jCi rtaK 149 cyfil teb gtg 35^67 
Ala lie AiTj llir 'ivr OCln I'm 'jyr \^ Leu Asn Asp 'Ttp tog Tyr v&l 
4^ d6db d9CI0 

£a.yt yttt gsc Bgc: g<y. a^s aa^s ^tx: Uit ^ic e^^t i?u:=t cat otc tat 3571!> 
Ser Aap Asp Ser Ala Arq ij^ Glu Ala 5yr Ala Acp Ttor Hie He Tyt'- 
49C55^ 4910 4515 

E35t att gyee Cg& giwi. ?vtc dm gtt ate ocg gcss. acw* one tOEj eta 3S763 
AEp Fro He dy Atr du lie Gin He Tta: Ala Lye Gly Tirp Leu 
4920 492^ 4930 4S35 

c^Qf csig aac caa tat ttc ccg tgg ttt crtg «igt g^a ©a^t- gwa jwat 3530.1 
Arc Gin ten Gin T^t Rw ^ 'i^p rti© llnr \totl Ser tSiu Asp Glu Aan 

4ddS d95Q 

gat ttg toe grt ©dci etc gtg taa ttgaatcaag attogctcgt 35658 
Asp Leu S^r AlA Ai&p Aid Lou Val 





4955 


d9€0 






rtaatgttaa 


•nna^^o^ata 


taatatacct 


aatagatttc 


gagttgcagcr 


goggci^isca* 35918 






atagataact 


aCfrt^tgij 


ggt^gtgaa 


agicagcv^aac :S597rs 






ScutyiAJLatck 


aatdJtgfMnpf? 


tiiccittsTtga 


gttctaaata 360'3B 


^fagtagcagic 


atattttjKtt 


er:Cti.l:totl. 






ttCTcCETtaaa 36036 


aatcLgtjCBt 


catgaig^ct 


aaaaata^dica 


ar^LtljilJiitL 




natcoatoAt 3<^158 


rrvfcal-t-ajjjt 


atgttBtaga 


atcrqflfttcfl 






ijizaaaatataa 3C213 


acatcogscat 


cggtaataa^ 


agctgatjgcr: 




IjCfcttttbat 


oocaagtQOC 3E278 


atatgticzcat 


actcaatacc 


agaataatta 


gatataccBA 




tttasjtoiscct 36338 * 


€4att)C^tatt 


tt£ikc«attact 


tt.tccLataa 


cgcti^actta 


aatt^trmct 


Atw^^t-tctx; 363?e 


gti^tqaaat 


t^taaaaiQitt 


aacattatoc 


^tagataaa 


aacxstycftg 




aatoggctct 


tttctcoctt 


ttttataaaa 


ttaaccattc 


i^tl-.l.lW;«iiJK: 


-uttoitttaicn 3&S1B 


tncragcaat^ 


ttgaaAc^a 






t*actccc:i5C 


aataccscTttt 3657S 






catti>artoct 


R3K«rmat£>a 


tcttaccatt 


aataairaacc 36638 






•:;?Jit^y»atMD 


cxxJtoaiSMC 


£X£K?gtana£» 


a^^aaataac 36&9& 


Acy'rtflcra^La 


ataattttcg 




tctCTzioaga 




aaaceatsat 36753 


aicjFiani^tsAT 




aa^tataat 






CMgaeu3ugcQ 3Gei8 


a j^ar .Sharon 








uCntoattaa 


c»umgaAtat 36B7B 
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tttcci3f«ca qat£*aaai*gT l^^gacaaata tiCKia£Kjat:£%a t.ttattt^?ti« tatet>gnt-«g ^b9.^^ 
cittatWiAAt aaraacatnc oatal-^t--&taa aacaa::actg gcnt^atatto atr^cntato •3699& 
<^tcooct.i.l39 nttg^attt rinociwtv»9Lj<: aiir.LtLacat aal^atata u£ictc\cBf^w:nc 3705© 
aoufetaal^r^ atasktaniraca tvftgiHOTiulil.l. Ar.ttttattB aaacaataac ^jaagattcat 371 IB 
t^tatAag^ aatoaziaana <a«ci^l:igatga saataatttt ttoCttccat taattatatcR 37179 
/«:flij^5tia iirfcKCUv^&tii- fl^-^tatcaa CgctactMt ognnt^ct^ ^itfjt^Jinwt 37?3H 
caaatcatat ^%^ilJir.r.cicac ccx:tgaatt5a tpopg ro agzi ogpftc»gw»c-« ^^tjca^ww^U :v;2y5* 
aa^Ka£iitigt^ ^aaaactta attc^atfka ocaaaatcccs ^it.tfli;*J9CWA ^<*j«toclv?t. 37:^?>li 
caaaaawww: Acagati^a cigtaattjcnzi atiwtttwpvscs t-U ^i«tt«tt%w;t*it. 57-^113 

c^zjncSMB;^ ataactatcg ?iimiaTOKict. nt^wtiwiAM Aflflarj^fllJif: 9rtt:tj*!?tjtJM<i :i747Fl 
/;yhKjc^tt:Ct ttooctaiDca jM>g5?i**C9?il- gBttgiBBttc ccctttgcsg aagc^aa^ .iVbSli 
ccttatKCittn c^t^av?M^:AAft AiJScaiataLa taccatcaaa gatatjjs^cag taaaataaaa S75SB 
tc^ttLtato ta^CBljcte gaataataat attig^jaajgat saajittacta aaacctcaaa 37&5S 
gataccact?T £w»;.^l.l;9CiS3cr gaAgtaatas aag^aaagg satataatga catttttatt 3771& 
cccagaogca c«ttbctbtta iXxjt^aKicr.r.t: atattocaag gcatcagoga ttattaaatc :i?77ft 
i-jsLactgcct ctet^uuac«y? <«wJwiibcl;4aaa taatgtcctt pgftgaatctt tagggaasttt 37836 
ogtccCOTM t^^J'AfA jbatEsgttflcrt gaaaacaata cattqatttt taattaeata 3789B 
ctrssf^fg^^^ ^ctLaat^ at^gctacttt ^.tctccoogt ctttc«3»CtJ:s2 37P*B 

■:2L0:' L2 

::iii:-]> Ftotortobdiis lusnmascens 

ftet Aai^j life ABp Pro Lys Ijeu Tyr Gin lys itir Faro WaL val Aan 
1 «> lo 13 

lie 'IVr Afdi ?L^.Tg <»ly Ljsii T^u* lie Arti Asn lie Aep Fhe Uls Ai:g 
20 25 30 

•rtu Thi- Ala ulv Aep llir ^^sp lie Ara lU^ Ha Arg Gin TVr 

35 

Tisp Eer LiEu Glv His Leu Ser CUji Sex Tlv Px" Aiy )<tu Tyr Citt 
SO 60 

Al5i bys 3.y=: fS^r Pi>f-. ]/iiJ Tvp Gin Tyx As^ Tttt- Gly toil 

ftS 70 7? BO 

Ti« Leu CVS "Ita" UUl Scr Val ASl^ ^^Iw fiJy Artj ViJJ T^r l^jj 

90 SIp 

Asp Tlih Glu Gly Arn I'rxj Jjeu Le^ 'Jtu' Viii 'Pai Ala Thr Cly ViiJ IJl^ 
LOO '^Ob 110 

Ci^j Ttir Arq dn TVr" Glu 'niy ftor Scir Leu Pro Gly Arij T^u isear 
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120 



Viil Thr CL\» <iiin life Pro Glu Lys 'llir £*ir Alti lli® Thr Giu Ans Lex? 

135 * 140 

Il<:r Tip Ala Gly ten ber Olu Altt CQw Ty!> AiJi> Hift AiC?fs T^s ?»la 

14 & isa 155 160 

Gin Cys His TV^ Asp Thx Ala Gly Veil Ttir Arg Lieu Glu Ster 

165 m> IVi 

lisu Ser Leu Ihr dy TSit" V)al I^fu £er GLn Sea: S&r Gin Leu Leu ser 
lao 165 190 

Asp ihr CJ n Glu 7il& Ser Tt:p itu: Gly Asp Aen Glu Ihr Ttp Gin 
195 200 'AO^ 

Jfet Leu Ala AsT> Jisp Ilo- TyT TJrr Tlir \tfu St^r. Ali^i Pfjft Af^:> 
210 215 220 

Ha: Cly Ala Xjsq Leu ito tsOn Ihr Aep Ala I^ys Gly Asn II-© Oln Arsi 
225 iiO 235 240 

ijeu "Shr Tyr /ifi^p Val Ala Gly GIli L^u Abh Gly Ser Titp Leu Tf^ir L£aj 
245 555 

Asp taOri fro Gly CI13 ^feil Iia TJ.tr Ac^ S3«^ [*W ly^ 6«ar Ala 
260 265 270 

AJet Gly Gin lys Leu J-jfg Glu Glu His GLy Asn Gly v&l lie ifhr Glu 
275 :a»0 2BS 

TVr Sea: Tyi- Glu Pro r^lu Tinr Gin Gin \jeu tie Gly Tte Lys Ttir His 
290 295 300 

Az^ Pi.^ Ser Aep Ala 1^ val L£U Gin Asp lieu Arg rryr Glu Tyr Asp 
205 MO 315 320 

Pro Val Gly Aot \ftAl IIl^ Suit He Artj Actj A.15P Ala Glu Ala Ifw Aiy 
325 330 235 

Pte 'Trp HiB Asn Gin lys \tel Ala Pro t^u Am ttsc Tyr Thi- Tyr Asp 

3^0 345 350 

Ser Lesw TVr t?ln Lsu II-? Ser hla Thr <:;ly Ar^ OLu Met Ala Asn Il« 
35S 360 365 1 

Gly Gin Gin tSer Aen Gin Leu Pto Sej: Leu Thr Leu Pro Bex Asp Aari 

3-'7D ?r>^ 

&M Tyr TSncL /if an lyr* TJtt- Ary ThTc TV^' "W^i^ 'Ti'^ A-'j^l^ fi*-V C»ly 
r5«^5 390 395 400 

?»£yn L^u Thr Ly& Jl-e Gli^ .Ser Ser Pro aIa Hnr Gli') abi^ asi^ oyr 
405 410 415 

Thr thr Asn He ihr val ser Aen Aig Ser Asn Arq Ala Vtil Lieu Ser 
4*20 435 

'Ihr Titai Thi' Glu Ae^ Pro Ala Gin \^1 A:;(i ALii I^u P)^ Asp Ala Gly 
435 440 445 

Gly Kiw Glj* Aax Thi' lsaj lie Sor Gly Cln Afc^r j low Aisri Tip Afi^ Tl^* 
4SD 455 460 
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Ajg Oly Olv Cln His \F£iL llir Leu Vtel Lye Arg Aep 1^. aiy Ala 
465 i70 -a^?* iao 

ABft -Rsp As|> Arg Gdu TVfj T/^ Arg TVr Scr Ser Jisz> Acq Ang JU*: 
486 495 

JjCTi IaV^ Jlo Asn Olw Gin GOjt ihr Ser Sftr ESctt Oln Tbr Ai-g 
GOD 505 Gl{) 

Ilc" ivr UBU rro Ser Ijbu GLu T^i ftrg 'ite CsCUi Asii Bear nir 
blS ^0 b25 

Lie llir 1^ tep Xl^eu Gin V^J Thr Vol Gly GliJ Als GJy Aid 
5iU 535 

Ala Gb\ Val Arg Urn His '1^ Aep Ser Gly Gbi Pim Olu Asp lie 
545 £50 555 5W 

Asp Asn Asn Oln Leru Arg Tyr Sat '^r Asp ften Leu lie Gly Seu 
565 570 575 

GOiTt Leu Glu Leu A.'-p .S;;r Lys Caly du He lift flftr diy Olu lyr 
580 b*45 S&O 

Ti?r Pre? Tyr fSiy Ciy l^u" Ala l£u Trp Ma tla- Aru li/s Arg Ttir Gly 
595 60<? <i6b 

Ala Ser "lyr Lys Ttu: rl$ Afg Tyr Sea: Cly Lye Glu Arg Ala Thr 
610 61& G20 

<Sly Lflu Tyr Tyj; Tyr Gly lyr Arg lyr lyj COn Pro 'licp val dy Ar^ 
^26 650 63S b^O 

Tly> LietJ fScT Ala Asp Pro Ala Gly TJir Vtoil Aep Gly Lea AJiii uyr 
6db 650 6&B 

Arg Vr^ Vol An? Asn Aflti P/ti V«I Thr liS!>j Ijeo Aflp Pa^ A^sp Gly bsu 
€60 €65 670 

Met Vto Ttar He Ala Gau Area He Ala AIa ri;v Oln Lys Asn Lys VaI 
«15 AKn 66s 

aLb 74fit^ Svr Ala Ser Pro Tlir Ala Ttir Am Wsl Alii He Asn 
65D 6S5 700 

Il<5 Arg no Pro v&l TOtt pto I^yB Rt-d- Tte ptT? Lys Ala Ser Tftr 
7€5 710 715 72D 

Ser Ser Gin £e£r: Tin: Ihr Ty^c Pro Ht^ 3.y3 ftar Ala Ster iLe ].ys l»rc^ 
72S 730 715 

Hhr 'fhr Sau C^y Seir Ser He Itir ;*ia Pro Sur I'ro Vei Gly Af.^i 
740 745 750 

Lys Sou TVjfl: Pro Glu Hk Ster Leu Kro Glu fiftr' Ttn* Gin aer Asti s^r 
7fib JBn 76S 

Ser Ger Ala Tlr.i Scr lln- Aen Ufii CIti Lys Lvs i?ei' Pbe fBir fjoui ivl- 
770 775 7R0 

Arg Altt Ayp Asn Arpr iSer P<ie Glxi A-sip Wet Gin Ser l.yi^ R*? Glu 
7B5 790 735 HW 

Gly iiys Ala Trp Tl^ P/,xj Leu Asp Tfa [.y?.; t4ct Ala A-rg Gl" Wkt 

005 ain ai5 
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Ala Ser VsL Itie lie Gly GLn Lya Asp "rtir Sfei' Atsii Lou Pto Jjys Giu 
820 B25 

Thr vaJ Lya Asn lie ten l^ir i:rp aly •rtir Lye Iro l.y^s ijiw Asn Asp 
e35 (t40 84 & 

Leu Sifir 'Hir Tyi- He Li^^ Tyr naf^- Iy& Jjys Ser Hir VaJ VteJ 
BSD 655 AtiD 

Iter Tjnr Ala T»<? A^.ai TCbc Qlvi Ala C^y Gly Gin Sar asr- Oly Ala 1^ 

sfis &70 B7S aso 

Ue^ «iB Calu ILg ten itet Asp Us^ Tyr Glu 3^ O^ir He Asp Gly Glln 
BBS 690 8*5 

l^yTS Tjfifti Aa^ Pro Ijsu Ptvrj Ar^ G]y Arg Ser Jj^re Aop Rxg \teL Piw &sr 
900 90& giD 

Ije^^ Leu Lesu Ayp Thr Olu lie Glit Tlir Ala He He Al^ Leu 
915 92n 925 

Jisn HiB Gly fro vaQ Aha Asp Ala Gbj V^l. f?r:ir fbe Leu Ihr Thr Tlr- 
SSt> 9:2.5 C|40 

Pro iJEt^ Aisn V&l Lye Pro Tyj; Arg 
»a5 950 



<210c- 13 
<2ll> 2S22 

Met Tie lAi Ly*i GJy lie Asn Met Asn Sisr Rtd vtal 1^ <Slu lie ped 
15 10 35 

Asp ^feL Leu ly& IL^^ <JLn Cys Gly P*>e Gin Leu l^ta: Aep llti fites^ 

26 30 

Miff Ser 9er rtie Af;ri OLu PSie hLb Gin (Sin V&l Gdu Kis Ser 
35 4n Jfi 

•iTp iter Olu Ala hIb Lenj Tyr E-Jis Asp Ala c»ln Gbi Al^ GOn Lvb 
SO Sb 60 

Asn Arg Leu Tyr GJu Ala tan He Leu ijye Arg T^^ Asn 1^ Gin 
70 75 RFl 

3U^ Gin Apo Ain Val Wis LEfu Ala lie VaI. Al6 fto Aen Ala Gl« ]jeiL4 
Hf> 90 55 

Ily Oily Tyr Ayci Asn GOn the Eer Gly Ar^^ Aiia Sfer Gin Tyr Va] Alia 
100 10S> 110 

Pro Gly rhr Val fiksr Sr^r- Met L+» Ser Pro ALs Alu Tyr Utii -Ihr Glu 
115 120 125 

l>fr- Aisg Giu Alfli Aiu Aan Leo Hie iVLa Ser AiFp Ser \fel Tyr Axa 
130 13b 140 

Leu Asp TJir Ar^ Ar^ Ltd Abp l£u Ly» Ser M?it Ala Leu Ser Glyi Gift 
IflJi ISO 1&5 IGO 
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AEn tJet Asp Ttir Glu Ser nhr beiu Ser beu Ser JSsn Glu l£ti LeSJ 
Ifx-^ 17D 175 

T<%i fiJ.iJ Srty lie Ly*: Tfsr- Seor Lyi^ 3j«ij Afjjp Aaci TVr 'fhr OOn Va.L 
IBO 1BI> li^U 

WBt Glu Met Ssx aLq ite AarsT I'SCD i3er GLy Ala Tiir Pt*^ IVr Kis 

2DU 205 

Asp Ala Tyr Glu Aan >tel Jftng i^b v&l lie gIji Olri Asp Gly 
TjO 215 22D 

1^ Glu Leu Asn Ala Ser Pro ?aa r]i; Alis. Gly UEXk Met Hi£ Gin 
225 230 235 240 

Ala Ver usu Leu Csly He Asn Ala Ser lie fer Glu Lou l*hB Asn 
:24i!? 250 

T.JO. l*:!u 'nw/ Glu GiJu 71o T^kt Cd« Gly Aso Ala Glu CSlu Leu Tyt^ Lys; 
260 265 210 

Li's Asn Itc dy Asn ile Glu Ala Ser Leu Ala GLu lyr 

275 260 285 

Leu Ar^ Arg -Tyr Ivr Asn Leu Sex Asp Glu GLm l^j Stir <31n lie 
2S0 395 30D 

Gly uye Ala Sex Jisin Hy^ Gly CLn Gin GLu 'lyr Sex Asn AfaQ GQii Leui 
305 310 315 320 

lie Thr ItfP lie Val Asn Ser Am Aep Gly •fl^i- Veal Lys V&I oyr hrq 

3<25 i30 335 

Ile ttir Am Glu Tyr Utix Tiir Al* Asri liln Val Aep du Lc*i 
^60 34S 

rtje iTO ']Vr Glv Gly gIli A5;n iVr CUn Leu Asn TSo: Lys rt>e Ly:; A-;^:^ 
SSi 360 

ifer Arci Gin AEp Via I 'lyr lisru Ser Ile lys L£aj At^ A-fp l^ys Ainq 
:i7b 3VS 330 

01m Uya lie? ArT2 Ile Glu Gly 7da Prn> Qln VfiL Asn ile Glu Ty?- &-i 
3B5 390 39S iOO 

Glu Hi-B lie TtTo: '±£si S^tt Thr l*hr Asp lie Ser Gin Rk.^ Glu lie 
405 41.0 dlS 

Gly T<nj T^]yr Arg Val 'lyt: Pro Ser Ser Tr^-* Alct 'jyi" Ala Ala aJa 
d20 425 430 

LyE Pbe Thr lift Glu du 'jyr Asn Gin Tyr Sesy P4>e leu Lgu lys Leu 
435 440 d4iS 

Aan Lys Alo Ile Arg l^-u Sk^r Arg Ala Itur Glu Leu SSear Pn^ Thr lie 
4bP 4hh 4GD 

fjLni CJv lie \fal Arg S^r Wet I Ai;u Clji CLn Leu Asp Ile A-ai AI-m 
46G 470 47S 4 BO 

dii vsl Leu Gly Ly:: v«l lihe Leu Ttir lya lyi- Tyr Ifci. ajn Arg lyr 
4B5 43n 4i^5 

Aly rie Aor> A^.tt tlu Tyir Ala Loj Ile Cys A;Lin Alca Leu ile itei* 
500 505 bH> 
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Gin Arg JSfer 'JVar Asp Asn Gin Fit> Scr CLn Hid Asp As:g icu Ate Asn 
SIS B2C> S25 

TSir 9ix> Leu ijfiftJ A<a^ <5ln lyr: Pi^ ^^er Thr; Gly A^i? Glu ll-e 
530 SdQ 

Asp Leu fsBR Vxo Gly i*Er l-hr Gly Asp 'incp Anj lys Ser val Leu Lye 
«^ tiSQ 555 560 

Ai5g Al£s flier ABD lie Asp iLfi Ser Lau 1^0^ Arg Tjeu- T^i Lys ric 
!i6!> 570 CT5 

•RiT- Asai Ki3 A3n A£5n Gin A^-p Giy ].y;^ TJ.y i^.y As« ten Leu Aan Am 
^30 SBB B90 

Leta Ser Ast> Lfi« lyr lie C^y Leu Leu Ala Glu He Ki-? <3)n Lru 
-S!*5 600 

rrtfn: He Asp GLu Leu P-ep Leu Leu Lfiis \fca Ala Vol Oly Clu Glv Giu 
GIO 615 620 

Thx ^1 T4aj Ser Ala 7.1r? 6cr A-sip Iot OCln Leu Ala Ala Leu He Jit^ 
£25 630 63S &40 

Lye Leu Asn Hit He 'itsr >fel Tsp Leu GLn Utii: Gdn Lysj T^p Scr ALa 

£55 

Et^ Gbi Leu Ete VaL Met Ttar 5er Thr Seer TVr Asn is/B Thr Leu Thr 
6&D S€S 670 

Pro Giu ILg Aan Leu Leu Asp Thr V^l Tyr HL*5 Cily Lajs Gin GO.y 
£?5 «©0 665 

Pte Asp iys Aarp Lys Ala Aan Leu Hia v»l Mc?t ALa l>ro 'iyr He 
690 &95 700 

ALa AJa "ahr Leu GIj^ Leu Sei- B«if OJ.xi Acn VhL Ala His ser VaL Lpu 
"?05 710 71S 7^0 

Leu Ti:i> AJa Lart5 Jjb^j Lys fc^ro GLy Asp Gly Ala Met. Tlu- AL<a GLu 
725 730 7i5 

lye rtie Tcp Ast^ Trp Le*i Artri Tlii Gin Tyr Thr Vro Asp Sei- Ser Glu 
"740 715 750 

V^jl i£u ALa Ttuf qJji CLu Kis He VSal Gin Tyr eye Gin Ala Iaij Alo 
755 760 7&5 

GLn Lq^ Olu fcfet Ifeil 'IVr Hie ser Thr Gly Ilrss ASvi Civ Assn Ala lt» 
'P'JO 775 7a& 

Arta Leu Ftsi^ \ftil Thr Lvs l?ro GLu MeL Phe dy fer .t:?;T- Tlir Glu ALa 
7B5 ?90 795 

V-isl Pro Ala His; hfi^i A^Iti Leu iter Leu He Met Leu Thi Arg Tf^. Al^ 
BOS 91 n 815 

Aep ivp Vfcd Asn Ala Leu C^y Glu L^?: AIa fitftr Sisr Val Lo^i Ala Ala 
B20 B25 SiD 

F^ie GLu Ala Asn t^ei: Leu Tin: Aiy Clu. C]ti ]<tu A1« Asp ALa Met Aan 
«35 840 &dS 

Uu^ Asp Ala Aan Leu Lau J<ru CsLn Ala Ser 'ihr Glji Ala Gin Asi^ >lift 
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aso ^s.s a6o 

Gill His T«i Pro Pro wal TS^r c^ln lyf: Asm Ala Piifi Sfce-c <Vs Ttp Ihr 
*^E> 670 875 HHO 

Seof lie Aep Ihr lie Leu dn Ttp Val Asn ^tel Ala GLo Gl« i^aj a->i 
nj^'i 8S5 

\ftnl A1^? Ptti Gdn Gly v&l KttT Tila Leii v&l G^y Uiij Aj.^ T^jt lie Gin 
9O0 905 &10 

mi Aan OJti l.y-^ J Irt Pro T^tt TVt Al?t Cln Tip Clu iter AIq GIv Glii 

S20 925 

lie Leu Thr Ala GIv Leu fisn Ser Gin Cain Ala Asp lie Leni F3l c Ala 
930 9^5 940 

JFtxT top Cxlu Sftr JSiy f^er Alia- Aln L™ Sstt Ihr 7vr 'iVr lie Axg 
945 950 955 96l> 

GIjj VSil Ala La^ r*ro Ala Ala Ala lie lyb Set Arq Asp Aep i^.^ t^t 
96£ $7D 97S 

Gbi ^ryi liE-j Ueu lie Asp Asn Gbi 'U&l Ser Ala Ala iLe Lys -fto: 
9ftD 990 

Ar9 lie Ala <3lu ALa lie ALft Ser' He Glri Luu TVr Vol Asn Arg T!la' 
995 lOOO 1005 

Lcxs OILS tei) Wftl GlLi Glu AEn Ala itLa ^ csay V&l lie Ser a™ Gin 
lOlO 1015 

rt-je Ite lie A^ Tnp Asp Lys TVi' ^an U/b Arg lyr Sfer Tta- Trp Ala 
025 1030 1OT5 i{i4Q 

Gly Val Ser Gin Jjau val oyr Tyr Pxd GIu Asa T^r He Asp Thr 
1045 1CI50 lOBb 

bJec Arg sly Gin li^r, r-js^l; r^fct. A^ Alci Leu Leu Glri Ser I 
aOGO 1065 lO-JD 

Ser CLn Sot Oln Leu Asn ALa A^ □to: \nal du A-a^:! Aid Fjkj »Bt Ser 
1075 mgo 1DB5 

Tyr Leu Thr Sei Ph£- filu <31j^ ALrt A?ai Leu lifs Val iLe Get^ /Ua 
1090 1095 IIQO 

nyr A.':p Afirt TJk? A«n A-^n Asp Obi Cily Leu Tbr Tyr Ph#t Ho OLv' 
'its lilO iiih> i:i2D 

Leu tier c^u 'Stir A^ Thr Gly -nlu Tyr Tyr Tr^ Arg Str Val Aep tJis 
1125 mo 1135 

Sear Uvb Pte Ser A-sp Cily lys She Ala Ala A'j> Alea Tn> Ser Gla Tiro 
ll^<> II4S iiJiO 

Hi-: ].y's il<a Afi3i> Cy^^ Pru 3Ie Asn Pto 'IVt Axg ser Ha: lift Axig [^v 
1155 1160 1165 

Val i*3t Ty^ ^*er Arv] Leu TVr L^ L£u T-rp Leu fiJv Can Lys GIlj 
1170 1175 iieo 

He l*ir Lya Gin Thr Gly Aui i3er pyK Gly lyr C5ln Ttar du 11 ir 
1B5 1190 1195 ■ 1200 
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Afip Tyr Ai^ "Vyr CSIm Liuu Lyjs Lc-u Altt Hl& lie Aiy Tyr Asp Glv 'Ihr 
1205 1210 121.S 

Trcp Af^i Ihr Fro ll€r Thr Asn Glu Dye lie Siex Lyf: 

1220 L225 1230 

Clu Lfu Ma Tj^^.'^ Ala Prv G]y I™ TCyr Cv^> Ala Oly ivr <5lJi 

1235 L240 m& 

«lv ulu Asp Ttr Txai TjcflL? Vfeil Met Phs '£yr Asn G£ln Gin Aap Thr Leu 
I2b0 125S 12e0 

Aw J^ftr Tyx L^s Ser lAet: CLn Gly iJeu Tyr lie Pte Ala Asp 

1270 1275 1220 

ttet GLii ^lyr liys asp Mat Ito Aep GLy Gin ivr i>/b £fer Tyr Ajng? Aap 
1265 1290 L295 

A^^ri fiftr lyr Ijye Gin ito AEp Thr Aen Sea: Arg Arg Aa^n 
1300 1305 131 D 

Ars Tyr Ma Glu Aep oyr Clti Sle Sear Ser v^l Asai Soar Ar*3 Law 
1315 1320 1325 

<2ly Tyr Asp Trp GSly Asp Tyr lyr beu Ser Met val Tyr Am Gly As^ 
1350 t2i3S 1340 

1L$ Thr lie Bier Tyr I^s JlLe 7*ir J>er Ser Lc« I^s He IVr 
345 1350 li65 136£> 

lie Sot F» J-yS lax Arrj He He His Aan Gly IVr Gly Gin Gin 
1365 1370 

Arrj Asn GQii Cyi? Asn Leu Met Aai Liys lyr Gly Leu Gly Aep Lys 
t3B0 li«S 1390 

JIk lie val '^yr Thr Ser l£u (Sly VaI. Asm Pn^ Asi^ A^fi an: Sekt Asn 
13S5 1400 1405 

L^i^: Met Fte Tyr Pj.ij Tfyr CLn 'Xyr Asn Gly Aon vaL Oer Gly 
1410 1^15 1420 

Leii Sbr Qln GUy Arg ijaiu Lefu HIb At:xj[ Asp ^Ttir Aa^ Tyr Seir- ^j- 
425 ld30 1«5 1440 

Lye val cslu ALst Trp lie Fro Gly Ala ely Aa^igf Rei- [(st? Thr Am Kro 
i^45 1450 145li 

AsTi AJl«i Ala Jltt <31y Aeip A^ Tyr Ala Tho: Aep 5er Leu Aa^ Ptn 
1460 14« 1470 

Asi^ Asp i£u l^ns fil3^ tVjc' Vhl IVr Met Ohr Asp Ser Lys Gly itir Ala 
1475 Id^b 1i4JSfb 

■3ta: ASP Vhl Ser Uily Pro Val Afsp iJjft a™ Hifr- Ala TU? Qt-j: Pto Ala 
1490 1495 L50O 

La^ V^l Gin Tlic VbLl 1^ Ala Gly Ser LyB cau Gin Thr Pl« Tte 
505 1510 15L?i 1520 

Ala Aep Lye Asn Val Ser IL& Glrk Pki> .Sf^^ PLt> Seo: Plu^ A«;]> Olu Met 
1525. 1530 1535 

Am 'lyr Gin Phz; A;.^fi Ai^i Leu C*lu He Asp Uly Ser aer U&a Asn Ph& 
1540 15» 15&D 



BNSDCXID: <WO 9942589A2Tl_> 



PCr/EP99rt>1015 

-61 - 



rtu Asn ten Sex Aia S&c lie Asp iLe Thr Hie tIti: Ala flie ALa Glu 
1555 ISSO 1565 

Gly Aug tj^^ i£u Gly Tyi: <5lu iJer feor lie m> He Ttir Arg 
IS70 1575 iSftft 

U'E vca Ser Thsr Ass¥> 7i«a S3er Leu Otir Leu Aarg Hia Asai fiiu Aijn Gly 
i^ab 15S0 1595 1600 

Ala GGltj lyr ttet GLri Ticp G>y V^il Ar^ IJi- Axg Itfu ten Ihir Leu 
ieD5 1610 leiS 

Pb3 Al« At« Gli^ Val Ala Arg Ala Tte Ttir dy He Asp Ttsj; He 
1620 1625 1630 

Leu ssx Met du Thr Gdii Asn Ilu Gin CIti Fro OLn Leo Giy Lya dy 
iKJb 1640 Iti^b 

ri^ 'Tyr ?±a Ite rtie vcil ll© ito 'lyr Aan Pro Ser Ttir )3is Gly 

1650 

7i£^ du Arg rnrp l^. leu Tyr lie l^^s Hi© val Val Asq> Afsri Asn 
'^115 1670 1675 IGSO 

Sax t-li^ He He Tyr S5ftr- cjly Qh-c Un*. it/r, 74S|> Tl^ ftsn lie Ser 'Jtor 
1S85 1690 269^ 

Thi' Leu Phe He Pro Usa Asp Aep Fro Leu Asn Gin A^ l^r- f^.T- 
170D 17Ql^ 171C> 

AiA i-^^Sf '^^tel lyr Met Thr l*rifi lys Sfer Pm> Ser Asp Gly JJir 'l'n> 
1715 1720 1725 

Itp Gly Hift l^te \fisi.L Aire tep As?p Tsy?.; cay Ilo "fhc He Aan 
1730 1735 1740 

Pix^ Lya 1^- He Ijbu Ihr Hie rtie Glu Ser val Aen v&l Leu A£n Asrj 
745 IVSn T755 1760 

Tilt fic-r safti- Glu Pm 1S6L A.';p Pl^ Stir Gly Ai^ Scx Le^ Tvo: Phe 
1765 1770 a>75 

■rrp GlTi Iflu Fm IVr TVr Ihr t^o >5et Ijeu val Ala Glii Arg Laj Leu 
I7B0 neb J 790 

liis du Gin abtj Aep Gl'j Ala J».Bn Arc< T^p 1^ Ly?3 Tyir v^l 

1795 isijo ■ ieo6 

£fer iTa Sdt aiy 'ftjx He Vai His Gly Gin ile Qln Am Tyr Glii Tr^j 
1?*10 i?J15 1930 

Vr-l. Arig PWj ijtr^o Clu Asp TTar Ser 'Jx-p Asn Ser Aep Pro ^^-^ 
B25 1330 l«is 1940 

Abp Ser \ftjl A£!3^ l^-o Asp Ala V*il Als GOj^ Asp Pro Mrtt. Tyr 
l^i^ 1950 ias5 

r^fl Viil Ser •mr Phe Met Ait^ T^ir- )-^-m Ar^P Tcii L;:u Ho Aln Arg ^ly 
1S60 1865 leva 

A^ His Ala lyr Ars Oln Ueru Glu Arg Asp Ttir Leu A«ai Gli) A^et 
1»?7S i^BO 1BB5 

Met Trp lyr Met dn AlQ Lea nift ltf»^ ijj^i Gly Asp ive Pro tyr Leu 
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ISdD 1B95 1300 

Pio Leu Sea: '3tir "Jhr iTp fisci tep I?tr* Arrj Low Asp Alca Ala Asp 
905 1.310 1915 1920 

Ilo Tbr TJtT- Glji fifesr Al& Set- Ser Ser lie lal Ala Leu Atg Gin 
1925 1»0 1335 

Sfiir Hir B^D Aid Leu leu Ser Lau Ara Siar Aln Asn Ihr Lssu 'ftir iis5> 
1^40 IShlB 1950 

Tjejij Pbe i£u PTD Gin lie Am GLu val Met Met Asn Tyr 'llrp CSln thr 
19SS 1960 

Leu Ala <aln Aii3 Vhi IVr Asn Leu Ans His ten Ltju Sex lie Asp Gly 
1970 l&eO 

Gin Pi-D Leu lyr Leu Fro lie ns/r aIb itor Fro Ala AEp Pro L^ ?Qa 
985 1990 195^ 200O 

Leu l/ya 6f>r Ala Ala Val Ala Thr Ser <3Lri Gly GJy C*ly l^'S; L™ Pro 
2005 2010 2015 

Glu Sex Ite WcH:. Ser LeLi Tlrp Arc Cl» Pro His Met Ejsti Glu Aen Ala 
2020 2t}2S If^iO 

Jpr?g Met val Ser l£u rrtir g1« Phe Giy S3er Li3M QIh Asn 
2^>35 2MD 2045 

lie lie G\m Arg CLn A'^ Alsi Ohi Ala Leu Asn Ala Leu Leu Gin itEn 
20&0 2055 2060 

GLn Ala Ala du Usa lie Leu Ohr Aan Leu He Cain Aep iVir 

06b 2070 2D7t 20UO 

lie GLu JjBU Asp Ala Glu Dy^ ahr JJaa C^Lu Lys Ser Lye Ala 
2065 209O 209S 

Gly Ala Gin Sax Arg Phe A5-p fSei^ ©tar l.yy ijt^i b3i« Aap Cdu Asn 
2H?0 2105 2110 

lie Asn Ala Cly Glu Aen t^ln Ala Het Itir Leu Ar^ Ala 3ex Ala Ala 
2115 2120 2125 

Gly Leu ite Ttu: Ala Gin Ala Ser Arg Leii Ala Oly A\ix Ala Ala 
213D 2135 2140 

A^£> T-ofti val Pro Aw lie rtx; CQy Pty:? Alft Olv Oly Gay iJer Aaog Trp 
115 2150 21E.5 71&0 

Gly Ala lie Ala Giu Ala Thcc Gly Pyar Val >fet Glu PJk; S^jr Alzi Aen 
2165 2170 21V^ 

Vftil tet Asn Thr Olu Ala Asp Lye He Ser Gin Ser Glu Tlir syx- Axg 
21 GO 7165 2190 

Arg Arg Arg Gin Glu TVp Glu Gin Aiy Asii Akti Ala Cllu Ala Glu 
2195 2200 2205 

Leu l;^ Glr2 Leu A?p Ala Gin Leu £^ Leu Ala V^l Arg Arg Glu 
2210 22iS 2220 

AJa Ala Val Leu Glii Lys tItt Ber Lys Tl^- oin Clti Glu Oln 'ftir 

225 2230 2235 2240 
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GlJi Ala Cln Lew Ala RmS; Ixxj Oln Arc Jjys rhe Ber Asn GOn Ala Leu 
22dS 2250 32^5 

Tyr A-'Dfi Tt:p Lea Arg Gly Jung JjrAi Ala Al« lie Tyr Ite <3}n the iVr 
22 60 22 6& 22V0 

Asp Leu Ala Ilo Alci Arg Cys Leu Het Ala Glu GLr\ ALa lyt Trii 
2;27B 226U 22B5 

lie Ser Abp Sor Ala Arg Phe lie Lyie Pro fily Al^a Tni <5iln 
22^0 22&S 2300 

Gly Ttir TVr Ala Gly l£u Leu Ala Gly G^u Tjcaji Wet Lbu Ser beu 
105 :4sio 2iiD :2:52i) 

Iter. GlIu Tt^^j Lnu Arr;? Axr; Aep fccq Ala Let) Glu 

2325 2:^30 2325 

veil Gl\i AL-g Thar tal 23er Ueu Al£ Glu lie Tyt* Al?^ Gly Lcut Oln 
234D 2345 23G0 

A^ LyB Gly Fro Fte Ser Leu Thr ain Clu II0 Clu 1^ Leu 1^1 Asn 
3355 23 GO 236b 

iMa GlY <31^ 3<sir GIv Asn P,sti Asm Leu Al^^ Qly 

2370 2S7S 23 BO 

Als Gly ihr AEp Thr l^ns Thr Sei^ Leu GLn ALa £ter lie Ser Leu JQfi 
3S5 2390 239& 240O 

Iis^ Zt-£H Lys He Ajce GlLi A^ 'Jyr 1?xto Glu Ser life Gly Lys 11© Aru 
2405 2dlO 241S 

Arg ll<* Ly« lie Sbt val Tftir liSftj Pro Ala l^ia Gly Xto lyr 
2^20 'JdZ5 2430 

CiiR Cln Al£t lie Leu sex Tyi' Aa^ Ly^ Ala cSly hsa Ala 

2435 2440 2ddS 

Asn Gly eye ALa ALa l-tfu ALa Vol £Ser Kis Gly Hkt Amtj A.op Sex Oly 
2450 2dSB 2^60 

can Rjsii Girt tep ite Asn AEp GLy Ly^ Ftw Lcfw ITo the GLu GLy 
465 2470 2d7B 24B0 

He Ala He Aep Gin Gly Tiir TJir laa Ser Phe Pro Aeo AUi ^Ictt 
24P5 2490 2495 

Ttac Pro Ala Lye GJy La^ Gin Ala ita- ttet L^i i^c; 1^ Uw Asn Aep 
2500 :i505 2S10 

ile He Leu rtis He Axg TVr TTir He Lys 
251^ 2520 



-:2iQ> Id 
^2H> 1481 

*:213:' itotortiabdiis luminef^j^^; 
<400> 14 

Met. Gin Am Ser OLn T!hr Ric Sax Met Ttir Glu Jj<-u Pro Lvs 

1 G 10 15 

Gly Gly G3y Ala He Thr GLv Met Gly Clu Ala beu Thr Pro Ala Gly 
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M 25 3D 

Pro Aep Gly Mf-t Rl a Mel Leu Ser Ubj Pro Leu Pro He ser Ala Gly 
35 40 45 

Anj Gly iVr Ala Pro Sea: Tin: Loct ten Tyr ^Ssn Ser Gly Ita* Giy 
50 55 61) 

Asi Ser fro E^:»e Gly r<aje Gly iTp Aep Cys Jisa V&L ifer. TOir lie Arg 

70 75 eO 

Ar^ Argj TShr j^r Thr val Eto Aen T^r <?lv Hit A-siP Thr Hsl^ 
SB 90 95 

C^y tPro Glu G3y Gl.n \'fea Jjbu val Vfei Ala Leu Asn Glu Ala Gly 
100 105 110 

OLn Ale Asp lie Arci Ssa.- CSj^ a:ar Sacr Leu Gin Gly He Aan Leu Gly 
11 120 12S 

Met Tto Pte Thr Wl Thr dy Tyr Arg Ser Arg l^j u Setr; Unr; Ptxi 
13n liiy 140 

ser Arg l>5(u Olu Oyj: ivp GLn Prt> gj.n Ihr Thr Gfcr Ala Itir Aed ste 
145 150 156 P^JU 

Tip Lou lie Tyr Ser Prv Agp Gly Gin Ala Ida Oy ly;:; Aini 

ie5 170 IIL 

RfD Glri A)a Aro lie Ser A£rt Prx? Lbls Asn Val Aen Gin Tl:ur Ala <SLri 
ia<i ISS 190 

Tip Leu Glu Ala 6cr %al Set Ser His Gly Olja OOn lie O^r Tyr 

50O 2QS 

Gin Tyr Artr Ala Glu A^ Glu Tlw Asp CVs <51u Thr Aro Giu I^eu Thr 
210 215 

Ala His Pro As^i Tlir Ohr V&l (^n Arg lyr Lsu Gin Vtil val ±iLB lyr 
:^30 2:^5 240 

CUy Aai Leu Ote*^ Ala Ser Giu V&l ItKj Pro Hit Lbu Abo Gay >^ Aep 

250 25=^ 

ETrt lj>fE: Ser Gly Trp Leu ite Qje Ijeii V&l Phe- Aav TVr Gly Glu 
250 265 2'?D 

Arg lJi?a i\£Xi Ssr Leu S3er Giu >teb Pro Pro Phe lys Ale rrtxr Ser Ah^ 
275 280 265 

lip Lieu cy^; Ar^ ij^ Asp Arg Pha Ser Arg 'lyr Glu Tyr Gly rtift Ale 
2M 200 

Leu Arg TYkt Arg Arg ]^ Cva Arp Gin Xlft Uyj Mt^l Ph;; Riy Aot Dcu 
305 310 315 ' %>.(^ 

Gin Hir JjSM £«ar dy Gin A)« liys GLy Aop Aep Glu Pr.o aIa l^^i I 
^25 330 325 

S*.x Arc Lieu He Leu Tyr Jisp <3lu Assjcj Ala Val val Ser Thr Leo 
340 345 350 

val ser val Am Anqr ^^l Gly Hj.s Oli* Gin Asp Gly itir Thi* Ala val 
355 260 3G5 
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ALa Lgsi Wro Ettd Gin Lew Ala lyr Gin Pro flte Pvts C3lo Glii 

:i'^0 17S 390 

Lys Ala lifrtj Tip An3 txo Met 7i£3£> Val UtJi Ma ten the Aa^ nir lie 
-^^0 MS 400 

Glr> Arg lip Gin Asp Lea ULn Gly Glu Gly Vftj I Prtji Gly lie 

^05 410 41& 

Leu lyr Gin A^ Asn CSlv "Etp Tip Tyr Atxj Sbr Ala GQn atr tSln 
45!0 42& d30 

tfa: Gly Glu <?iw Met Asn Ale VSal TJir Tip Gly Btet Gto lisu 
435 MO 44S 

i>rD lie Thr PKO Ala lie Glai A£?p ASirt Ala Stor Ijeu het Aep lie 
450 455 dfU) 

GOi^ A5P Oly GOn mi 'Ttp '^^1 lie Tla: Gly Pro (Sly Lbu Atr Gly 
465 <a7{) 475 

Tyr HiB Sex GJr) His Pro Asp Gly Tip 'Jhr Anj Bhe Thr 

d&S ^90 495 

Hi^ Aitt Ixa* lie &.Xi TVr Thr fiiB Pro Arg A1& GSlri Lpsu Ala Asp 
SOO 505 510 

l£u Jfiet Gly Ala Oly Ijea S3er ASp Leu \lt&l JJeu He C51y Prt> Ive; Sar 
515 520 525 

Atts 1^ oyr Ala Aan iisn tog Aep CQy Ite Thr C51u Gly Are Af3n 
530 535 iidO 

val din ©or cn.y c2Ly Il^e ITir Leu Pro lien fro CSly Ale Asip Alci 
5^5 550 555 560 

Jung lys; Lai V&L Ala Pho Ster Aap val Leu Oly Sbr dy Gin Ala His 
565 570 575 

Leu Vtil Glu VaL £ter AI3 Ttir Lys Vhl Tta: cys TCp Pro A£;n Lcxi (Sly 
580 5B5 

Ris Gly Arg E%ifr Gly Gin Pro Tie Thr Pro Cly Ete 43fer Qlis Ser 
595 60D 605 

Ala /iDa ten Ite As!*i Pro Agp Arg \te.l. His Ala Asp A^ Gly 
610 

Ser Gly Pro Ala Asp Laj Tie TVr His Ala Kl^:; Luu i^isp Tie 
625 630 635 640 

File sec ten CSI1J Ser Gly Ami CSly F^-^e Ala Gin Pro Ite ^hr Leu Arg 
64Fi 6f^f> 655 

Pro Asp Gly Uju Argr 1^ Ab|i a<;d Ttir Oys Gin Gin val Ala 
660 665 670 

A^ Vfll can C31y ijsiu Cly Vfal val £»; 1^ iJ.c Lc^^ Val Pro His 

6B0 fiSvS 

ttet Ala Pxtk His HLB TLp Arg CVs A^ L^ Thr Aeo Aid Ptd Tee? 
690 695 700 

T.eu Ser Glu Met tea A^an Aan t-fet Gly AIa His His Hir Le*i His 
705 7L0 TIS 
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'jyr to? Sbt Sset vbI GOn Hie Ttp Leu Asp Urlu Lye AU* ALsi Ala lieu 
72S 730 -?35 

Met TVir f?ly oin iiir E*ro V&L Oya lyr Let* \!to Bib I^ro Vel Rl& Thi* 
7^0 745 7S0 

Leu Trp <3[Ln Ttar cSlu TThr <3lu Asp Giu lie SSir aiy A5;r> Iy« Ijsii tol 
■JSS 760 765 

TJir Tlir i^ij Arg lyr ALci Mis Gdy Ala Trp AS!p Gly Arn Glui An? Oli;! 
770 T/S 7^iO 

iiie AiTt Gly Gly Tyr V&l cnAi Oji fflir ARp S3er Jds Glii 1^ Ala 
790 795 30n 

aia oriy AIg Pro Glu Atq -Thx ffer Paco Ala Leu itc: igys Asn Itp 

eio 

rcyr ALa Ihr C31y xle Pro <llv Val Aci> Tt^r* Trtij JSftir A]^ fil.y Tyr 

fJ^n eaEi 830 

Tap tog Gly Jisp 5f ir Olji Ala iS^ ita: (Sttv Vte 'ihr i>m bis l^ie Thr 

GdO £sdS 

Leu Ttii lys Glut tSly VI^lI Pm Leu Thr Pm Glij A^ His 

850 B55 8tD 

Tft) Tyr rcD Leu Asn Arg ALa Leu Lye Gly Gin Fro Leu Arg ser 
365 870 B75 SSD 

eiu LBU CSLy Aso Gly £ter Ala Gin can Lys ile ito 'lyx Ita 
BflB 630 $!>.S 

V<a.l Thf CSlu £ser Ang Pro GLn v&l Ajrg cajo 1*^1 Rdti A?.p A«2n Thr Thr 
&0L> 905 SIO 

LEU SET Pro 7;&1 TJcp Alei S©r Viil ^1 OLu Sfer Am £ser Tyr His 
915 92ft 9:^f> 

TVr Cau AT^ lie ll* Ser Pto G'Ln Cys Asn Cbi tey Ilo TTn- I^^sy 
930 935 

StjT £b!r Asp ijsf J Phe Gly Sin Pcx> Ltnji Iy& Gin Vai Scr V^l. Gin TV^ 
i'^^^ 950 95S 9&D 

Pro Jing Arg ftan Ivs Pro llir Thr Aesii Pro Tv^ Pro Asp TThr Leu Pro 
96?> 970 375 

Aop Thi- Le^j Pi\r- Ala Sor Ser Ciyr Asp Asp Glji tsOn Leu Leu Arg 
580 gs5 990 

1^ rphr Tyr Gin Oln iier Eer Trp laJf? m.j?: Tls Ala A.-5n Rdu T^i 
99s 1C»0 lOOS 

Arg V^l [/xi CSly Leu Pro Gly Ti^r Arg J5*^r' Ar.p AIa F*v; Thr; Tyr 
IQll'O 1016 l{i20 

Asp ALa Lv« His Vhl ITo Val AHp Gly Asii Leu Glu Ala tjeu cys 
025 1030 1035 1040 

Ala Glu Aan S&r- Leu lie Mi?. Asp Asp SiyB l^o Arg gLu lyr Leru Abr 
10^5 iOS^g 105!) 

Gin Arg Tte TVr Tte Afi?5> Cly Lys Thr Asp Gly ^ 
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•flir Rio Lfirti Lys Ttii- Pro ^ Arg tSln Ala Jjau He Ala Phe Thr SLu 
1076 1080 1085 

Ihr Ala val Leu Tlir GOii Scr Lbu Lew Seir Me* I^h Agp Gly Gly Il^t 
1 090 10§5 1101) 

Tl^r; Pn:> G-lu liBU »x> Gly Let] Lesfii ita- Clbi ALa C3ly Tyt Gin Gbi 
106 ill» ins 1120 

Gly Ifro TCVr Leo Fl:e Pro L«iJ ^-r Qly QIvl A-cti GIji Val Tip AL*i 
l:i25 1130 112!i 

Arg l^s Gly Tyr Thr Asp lyr Gly Thi- Glu wsl Gin TOp Arg Rro 

1U5 1150 

Val Ala GlxL Arig A.w Tlar c3in jjeo Ihr tSly 5hr Tta: Lea D/b Tip 
1155 11^) 

Aop Tin Hl& Tyr LVb V3al He Tlw.- aln Tl^r rtin A'^^j aIa Mj5* GV T^c^u 
1170 1175 1180 

•3hr VDl Ala Aen T^j:^ fer^ Tup Ang Ite Lbu 'itxc ftx> Met tiln ijeu 
iSS USD 11&5 

rrhr A^ rle Asn Asp Asa V&l eUb lie iLe Thr As-p Ala Oly 
1205 1210 121 S 

Ana iTo val Hir C^lii Arg rtie lYP tJly lie ulu Am Gly Ala Thr 
122C jSt^^ L230 

Gly T/r Sfer !5fcx l^o Glu Ala l^ws Pit^ i^jh -t^kt- Pn> Pit^ VUl Abi? Veil 
1235 L240 1245 

A?:n At« Ala lie Ala Thr' dy Pro Jj&c* toD val Ala ^Sln CVs Deo 
1250 L2IS5 ia<y:) 

Vtil lyr Ala Pro Atu 3er 'lip Met Pro Leu Phe <Sly C?ln C!lXJ Tl^r Phft 
^^^^ 1271) 1275 L2B0 

Asn Ihr Leu Thr aln Glu Glu GLn Lye T4ir Lea Arg Asp Leu Arg lift 
12S5 1290 1295 

11c Tte <Slii A^ Trp Jitg liB CV3 Ala Leu Ala Axg Arg Arg Tip Leu 
1300 1305 1310 

fibi 9er Glo l4fs Ala Gly Thr l=vx-, T<iu Vtd lys Lcti Lgu Ttr Asn Sor 
1315 U20 13^5 

lie -Gly Leij Pro Pro His Aen L£u Met i.w 7i1a Thr ASfj Arg TVr Asi> 
1330 1335 J.3dO 

Ajg ^^g> a?r Olu Gl3i C3an lie Ar^ <31ji GOji Vtil Ala the Ser Asp Gly 
1^50 1355 13e0 

rt^ft <3Jy Ary Leu Leu cSIn Ala Ala >tel Arg Ris Glu Ala Gly Olu Ala 
1165 a:57ll 1375^ 

i:icp Glji Arg ain Asp Gly Sei* Leu VaI •rttt^ i.yr, w-b <I>)v Aop ihi- 
1360 13 B5 1390 

IVB llu- tog TTi^ Al6 lie Thr Gly tog Thr Glu Tyr Asp Ann lys Gly 



1395 



14D0 



1^05 
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<3an Ala lie Arg Tlir Tyi- Gin Pro She Leu A^ri^ TVp Ara TVr 
1-410 14L5 1420 

S&r- As:-^? Asp Ssr Ala Arg Lye GLu Aia Tyr Ala Asp T3^ Hie rlns 
A25 lAso 143^ 1440 

lyr Asp l^tro lie Gly Ary CW|^ Dt^ CIji IfeQ Ilo Ohr Ala t=ay itp 
2445 1450 L4hS 

lisu Arip Gin ABO Qly^ TVr Fh© L>rtD 1^ i*is Thr V&l Eer C51u Aap 
1460 14fc;i> 1 470 

Asn Asp Leu SHK^ Al!^ Asp Alis ijeu val 
1475 l«W 



<21(>> IS 
<-211> 23 

<213c- Artificial ^iwHeo 
<22a. 

<22's> rj^sotriptiori of ArcificiaL £^u^5>c*?io:3neanurlOTtijde 
cg^tccga tgettttaaa «nc 23 



<210> IG 
<211> 16 



<220> 

<523> DsscriptiiXi of Arl:J.f.k:jAl .eeoijencHSDliOTiiJCleotide 

c4Q0> IC 



<210> 17 

<212^ rm 

<2:X3> Artificial Geqiieare^.'t 

-c221> DeEnS.ption of ArtlficlssL a*i^*q>«^ts:<>l.ig;Tn)LULaoc^ 
<^00> 1? 



<2LD?- IB 

>::2Ll:> 26 
<2I2> IS*!. 

^13> Art-iticial iieguam* 

<2;3n> 

<223> Deeolpticn of Artificial Sftjiiftryjerol.ljgiwxx^JijaM.i^h- 
oagcffagctc ttacttaatg fft^gCog 2e 



BNSDOCID: <WO 9942589A2TL> 



wo 99/42589 



-69- 



<21l> 2fi 



<22Q> 

<Z23> rescriptlan of Ai:ti£icl^ ^egijaxiasollQCAUieLeatijde 



<51£i> 2D 
12115- IB 

<2X3> ArLificial 



<220f> 

<223> Dsscrlptxcz) of Artificial £^9:juax3d:oli0cni;K7leDtijaa 

ggcBatggca gc^ta^ 16 



<21Ci> 21 
<212> 

<313> Artificial Seguencse 



<:220> 

-=i22i> Eescariptioffi of Artifijcial Saqjjei!oejoligcnuCl«Otlde 
<400> 21 

cat;c:Bacgaa ^gaai^gc 18 



<2103- 22 
<211> 2$ 
<212> CtnTl 

<213> Artificial ^Bqam:?^ 
<22Q> 

<222> tesdiption of JUrtificial ^^eguarsasoliQaniicLeotliSB 
<400> 22 

^cctcgagt taEsacg^gcg cgbcag 26 
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